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Abstract 



Nieuwenhuis, Oliveras, and Tinelli (2006) showed how to describe enhancements of the 
^ ' Davis-Putnam-Logemann-Loveland algorithm using transition systems, instead of pseu- 

^ docode. We design a similar framework for several algorithms that generate answer sets 

■ for logic programs: SMODELS, SMODELScc, ASP-SAT with Learning (cmodels), and a newly 

QQ ' designed and implemented algorithm sup. This approach to describing answer set solvers 

makes it easier to prove their correctness, to compare them, and to design new systems. 
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■ 1 Introduction 

' Answer Set Programming (ASP) is a methodology commonly used for solving com- 

binatorial search problems (jLifschitz 2008p . In the development of ASP solvers, 
computational ideas behind SAT solvers (jGomes et al. 2008)) play an important 
role. Influence of SAT solvers development on ASP systems is twofold. On the one 
hand, such ASP solvers as ASSAiQ and CMODEL^ follow the so called SAT-based 
approach where a SAT solver is invoked for search, possibly multiple times. On 
the other hand, "native" ASP solvers that implement search procedures specifically 
suited for logic programs often adopt computational techniques from SAT solvers. 

For instance, dl\|1 implements backjumping (jRicca et al. 20061) . ^-^id SMODELScJl dWard and Schlipf 2004[ ) 



extends the answer set solver SMODEL^ by introducing restarts, conflict-driven 



^ |http : //assat . cs . ust . hk/ 1 . 
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backjumping, learning, and forgetting - techniques widely used in SAT solvers. 
The ASP solver SUiH (jLierler 2Q08P implements these features also. 

In this paper our main goal is to show how the "abstract" approach to describ- 
ing SAT solvers proposed in (jNieuwenhuis et al. 2006P can be extended to ASP 
solvers that use these sophisticated features. Usually computation procedures are 
described in terms of pseudocode. In (jNieuwenhuis et al. 20061) . the authors pro- 
posed an alternative approach to describing DPLL-like procedures. They introduced 
an abstract framework that captures what "states of computation" are, and what 
transitions between states are allowed. In this way, it defines a directed graph such 
that every execution of the dpll procedure corresponds to a path in this graph. 
Some edges may correspond to unit propagation steps, some to branching, some to 
backtracking. This allows the authors to model a DPLL-like algorithm by a mathe- 
matically simple and elegant object, graph, rather than a collection of pseudocode 
statements. In (jLierler 2008[) . we extended this framework for describing such ASP 
algorithms as SMODELS, ASP-SAT with Backtracking, and SUP without Learning. In 
this paper, we expand our previous work on abstract answer set solvers to cover 
such features as backjumping and learning (and also forgetting and restart). We 
start by introducing an abstract framework that captures a general mechanism of 
these sophisticated features in ASP solvers. For instance, this framework provides 
the transition underlying the process of learning a clause, but it does not suggest 
which clause shall be learned. Similarly, it provides a general description of back- 
jumping but it does not supply the means for computing a "backjump clause" 
necessary for an answer set solver to perform backjumping. We then enhance this 
abstract framework to capture enough information about a state of computation 
for deriving a backjump clause. 

Usually, DPLL-like procedures implement conflict-driven backjumping and learn- 
ing where a particular learning schema such as, for instance. Decision or Firs- 
tUIP ([Mitchell 2005[) is applied for computing a special kind of a backjump clause. 
There are two common methods for describing a backjump clause construction. 
One employs the implication graph ( [Marques-Silva and Sakallah 1996] ) and the 
other employs resolution (jMitchell 2005^ . Ward and Schlipf (|2004p extended the 
notion of an implication graph to the SMODELS algorithm. They then defined an 
algorithm for computing FirstUIP backjump clauses utilized by SMODELScc to im- 
plement conflict-driven backjumping and learning. In this paper we introduce the 
algorithms Backjump Clause and Backjump ClauseFirstUIP based on resolution and 
the enhanced abstract framework that compute Decision and First!7ff0 backjump 
clauses respectively. 

In (jLierler 2008^ . we introduced the basic algorithm underlining the system SUP 
but neglected some of its features: confiict-driven backjumping, learning, forgetting, 
and restarts. Here we account for these techniques and use an abstract framework 
designed in this paper for describing system SUP. We emphasize that the work 
on this abstract framework helped us to develop ASP solver SUP, to incorporate 



° http://www.cs.utexas.edu/users/tag7sup' . 
The names of the backjump clauses follow l|Mitchell 2005|l . 
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learning into its algorithm, and to prove its correctness. We analyzed performance 
of SUP against such answer set solvers as CMODELS, SMODELS, SMODELS cc, and 
CLASfH. Overall, SUP performs well against these rival systems. 

We start the paper with Section [2] that reviews the abstract DPLL framework 
introduced in (jNieuwenhuis et al. 2006)) and some logic programming concepts. In 
Section[31 we define a graph representing the application of the algorithm for finding 
supporting models of a logic program. This paves the way to defining a graph rep- 
resenting the application of the SMODELS algorithm to a program in Section S) Sec- 
tion 14.21 elaborates on the relationship between previously defined abstract frame- 
works. Section [5] extends the abstract DPLL framework by introducing an addi- 
tional inference rule so that the generate and test algorithm of the SAT-based ASP 
system CMODELS may be characterized by this graph. In Section |6l we review the 
abstract framework that describes DPLL enhanced by backjumping and learning. 
In Section [71 we define a general abstract framework for describing ASP algorithms 
that implement such phenomena as backjumping and learning. In Section 17.21 we 
describe the algorithms of systems SMODELS cc and SUP by means of this framework. 
In Section [8] we extend the abstract generate and test framework to accommodate 
backjumping and learning, and in Section |8^ we use these findings to describe the 
CMODELS algorithm. Section |9] extends the framework to capture additional infor- 
mation about a computation state of a solver, states the correctness results, and 
describes how the frameworks are related to each other. Section \W\ provides the 
proofs for these results. In Section [10.31 and [Til we introduce the algorithms based 
on the extended framework for computing a backjump clause that are important in 
implementing confiict-driven backjumping and learning. In Section[T2lwe introduce 
the concept of an extended graph for the generate and test abstract framework and 
state the correctness results. Section [T3l provides the proofs for these results. At last, 
in Section [M) we provide the experimental analysis that compares performance of 
SUP with other answer set solvers. 

2 Review: Abstract DPLL and Logic Programs 
2.1 Abstract Classical DPLL 

For a set a of atoms, a record M relative to cr is a list of literals over a where 

(i) some literals in M are annotated by A that marks them as decision literals, 

(ii) M contains no repetitions. 

The concatenation of two such lists is denoted by juxtaposition. Frequently, we 
consider a record as a set of literals, ignoring both the annotations and the or- 
der between its elements. A literal I is unassigned by a record if neither / nor its 
complement I belongs to it. 



|http ://«««■ cs .uni-potsdam. de/clasp/ | . 
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Unit Propagate: 

CW I e F and 



M ^ Ml if 



Decide: 



M =^ Mr if 



C C M 



M is consistent and 
/ is unassigned by M 



Fail: 

M is inconsistent and 
M contains no decision literals 



M =4> FailState if 



Backtrack: 



Pl^Q=^ PI if 
Fig. 1. The transition rules of the graph dpj? 



Pl^ Q is inconsistent, and 
Q contains no decision literals 



A state relative to a is either a distinguished state FailState or a record relative 
to a. For instance, the states relative to a singleton set {a} of atoms are 

FailState^ 0, a, -la, a"^, ^a^,a^a, a^^a, 
a-ia^, a^^a^ ,^aa, -^a^ a, -^aa^, ^a^a^, 

where by we denote the empty list. 

If C is a disjunction (conjunction) of literals then by C we understand the con- 
junction (disjunction) of the complements of the literals occurring in C. We will 
sometimes identify C with the multi-set of its elements. 

For any CNF formula F (a finite set of clauses) , we will define its DPLL graph 
DPp- The nodes of BPp are the states relative to the set of atoms occurring in F. 
We use the terms "state" and "node" interchangeably. Recall that a node is called 
terminal in a graph if there is no edge leaving this node in the graph. If a state is 
consistent and complete then it represents a truth assignment for F. 

The set of edges of DPp is described by a set of "transition rules." Each transition 
rule is an expression M M' followed by a condition, where M and M' are nodes 
of DPi?. Whenever the condition is satisfied, the graph contains an edge from node M 
to M' . Generally, an edge in the graph may be justified by several transition rules. 
Figure [T] presents four transition rules that characterize the edges of DPp. 

This graph can be used for deciding the satisfiability of a formula F simply 
by constructing an arbitrary path leading from node until a terminal node M 
is reached. The following proposition shows that this process always terminates, 
that F is unsatisfiable if M is FailState, and that M is a model of F otherwise. 

Proposition 1 

For any CNF formula F, 

(a) graph OPp is finite and acyclic, 
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(b) any terminal state of DPp other than FailState is a model of F, 

(c) FailState is reachable from in DPi? if and only if F is unsatisfiable. 

For instance, let F be the set consisting of the clauses 

a V & 
-ifl V c. 

Here is a path in DPp: 

=^ (Decide) 

a^ [Unit Propagate) 

a^ c [Decide) 
a^cb^ 

The name of the transition rule after each => shows which rule justifies the presence 
of this edge in the graph. Since the state a^ c is terminal, Proposition [Ijb) 
asserts that {a, c, b} is a model of F. Here is another path in DPi? from to the 
same terminal node: 






{Decide) 


a^ 


{Decide) 


a^^c^ . 


( (7nii Propagate 


a^^c^c = 


{Backtrack) 


a^c 


=4> {Decide) 


a^cb^ 





(2) 



Path ([T]) corresponds to an execution of dpll in the sense of (jPavis et al. 1962| : 
path ([2]) does not, because it applies Decide to a'^ even though Unit Propagate 
could be applied in this state. 

Note that the graph T)Pp is a modification of the classical DPLL graph defined 
in (jNieuwenhuis et al. 20061 Section 2.3). It is different in three ways. First, its 
states are pairs M||f for all CNF formulas F. For the purposes of this section, 
it is not necessary to include F . Second, the description of the classical DPLL 
graph involves a "PureLiteral" transition rule. We dropped this rule because it 
does not correspond to any of the propagation rules used in answer set solvers 
whose algorithms we will model in this paper. Third, in the definition of that 
graph, each M is required to be consistent. In case of DPLL, due to the simple 
structure of a clause, it is possible to characterize the applicability of Backtrack in 
a simple manner: when some of the clauses become inconsistent with the current 
partial assignment. Backtrack is applicable. In ASP, it is not easy to describe the 
applicability of Backtrack if only consistent states are taken into account. We 
introduced inconsistent states in the graph dp p to facilitate our work on extending 
this graph to model algorithms of answer set solvers. 

In the rest of this section we give a proof of Proposition [T] 

Lemma 1 

For any CNF formula F and any state . . . /„ reachable from in DPi? , every 
model X of F satisfies k if it satisfies all decision literals with j < i. 



(1) 
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Proof 

By induction on the path from to Zi . . . . The property of X that we need to 
prove triviahy holds in the initial state 0, and we will prove that all transition rules 
of DPp preserve it. 

Take a model X of F, and consider an edge M =4> M' where M is a list li . . . Ik 
such that X satisfies k if it satisfies all decision literals Ij^ with j < i. 

It is clear that the rule justifying the transition from M to M' is different from 
Fail. For each of the other three rules, M' is obtained from a prefix of M by append- 
ing a list of literals containing at most one decision literal. Due to the inductive 
hypothesis, it is sufficient to show that if X satisfies all decision literals in M' 
then X satisfies all M' . 

Unit Propagate: M' is Ml. By the inductive hypothesis, for every literal in M 
the property in question holds. We need to show that X \= I. From the definition of 
Unit Propagate, for some clause CV I & F, C C M. Consequently, M \= ^C. From 
the inductive hypothesis and the assumption that X satisfies all decision literals 
in M' and hence in M, it follows that X \^ M. Since X is a model of F, we conclude 
that X \= I. 

Decide: M' \s Ml^. Obvious. 

Backtrack: M has the form Pl^Q where Q contains no decision literals. M' 
is PI. By the inductive hypothesis, it trivially follows that for every literal in P 
the property in question holds. We need to show that X \^l. Assume that X \=l. 
Since Q does not contain decision literals, and the assumption that X satisfies all 
decision literals in M' and hence in P, X satisfies all decision literals in Pl^ Q, that 
is M. By the inductive hypothesis, it follows that X satisfies M. This is impossible 
because M is inconsistent. □ 

iProof of Proposition [T] 

(a) The finiteness of dp^- is obvious. For any list of literals by \N\ we denote the 
length of N. Any state M other than FailState has the form MqI^Mi . . .l^Mp, 
where l^ . . .1^ are all decision literals of M; we define a{M) as the sequence of 
nonnegative integers \Mq\, |Mi|, . . . , |Mj,|, and a{FailState) — oo. By the definition 
of the transition rules defining the edges of DP^;' , if there is an edge from a state M 
to M' in DPp then a{M) < a(M'), where < is understood as the lexicographical 
order. It follows that if a state M' is reachable from M then a{M) < a{M'). 
Consequently the graph is acyclic. 

(b) Consider any terminal state M other than FailState. From the fact that Decide 
is not applicable, we conclude that M has no unassigned literals. Since neither 
Backtrack nor Fail is applicable, M is consistent. Consequently M is an assignment. 
It follows that for any clause C V I ^ F ii'C % M then C n M ^ 0. Furthermore, 
since Unit Propagate is not applicable, we conclude that if C C M then / S M . 
Consequently, M ^ C V L Hence M is a model of F. 

(c) Left-to-right: Since FailState is reachable from 0, there is an inconsistent state M 
without decision literals that is reachable from 0. By Lemma [H any model of F 
satisfies M. Since M is inconsistent we conclude that F has no models. 
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Right-to-left: From (a) it follows that there is a path froni to some terminal 
state. By (b), this state cannot be different from FailState, because F is unsatisfi- 
able. □ 

2.2 Logic Programs 

We consider programs consisting of finitely many rules of the form 

a ^ bi, . . . ,bi,not bi+i, ... ,not b,n (3) 

where a is an atom or symbol _L, and each 6^ (1 < i < m) is an atom. We will 
identify the body of ([3|) with the conjunction 

6i A . . . A 6; A -.6;+! A . . .-. A &™ (4) 

and also with the set of its conjunctive terms. If the head a of a rule ^ is an atom 
then we will identify ^ with the clause 

a V -.61 V . . . V -.6; V V . . . V 6,„. (5) 

If a is ± then we call rule ([3]) a constraint and identify ([3]) with the clause 

^&iV...V-&; V&,+i V...V&™. (6) 

We will often omit the symbol _L when referring to a constraint. 
We will use two abbreviated forms for a rule ([3|): The first is 

a^ B 

where B stands for 61, . . . , 6j, not 6i+i, . . . , not bm- The second abbreviation is 

a^D,F (7) 

where D stands for the positive part of the body 61 , . . . , 6; , and F stands for the 
negative part of the body not . . . , not bm- 

The reduct of a program 11 with respect to a set X of atoms is obtained 
from n by 

• removing each rule ([7]) such that F n X 7^ 0, and 

• replacing each remaining rule ([71) by a -s— D. 

A set X of atoms is an answer set for a program 11 if X is minimal (with respect to 
set inclusion) among the sets of atoms that satisfy the reduct 11"^ (jGelfond and Lifschitz 1988| . 
For example, let 11 be the program 

a ■<— not b 

b <— not a , . 

c <- a ^ ^ 

d ^ d. 

Consider set {a, c}. Reduct 11'^°''^^ is 
a <— 

c^ a (9) 
d ^ d. 
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Set {a, c} satisfies the reduct and is minimal, hence {a, c} is an answer set of H. 
Consider set {a, c, d}. The reduct n^"''^''^^ is ([9]). Set {a, c, d} satisfies the reduct 
but is not minimal and hence it is not an answer set of 11. 

By Bodies{Il, a) we denote the set of the bodies of all rules of 11 with head a. 
For any set M of literals, by M+ we denote the set of positive literals from M. For 
any consistent and complete set M of literals (that is, an assignment), if M+ is an 
answer set for a program 11, then M is a model of H. Moreover, in this case M is 
a supported model of 11, in the sense that for every atom a G M, M ^ B for some 
B G BodiesiU, a). 

A set U of atoms occurring in a program 11 is said to be unfounded (jVan Gelder et al. 199ip 
on a consistent set M of literals w.r.t. 11 if for every a G U and every B G 
BodiesiU, a), 5 n M ^ or U n B+ ^ There is a tight relation between un- 
founded sets and answer sets: For any model M of a program 11, is an answer 
set for n if and only if M contains no non-empty subsets unfounded on M w.r.t. 11 
(Corollary 2 from (jSacca and Zaniolo 1990 )1^). 

For instance, let 11 be program ([8]) and let M be a consistent set {a, ^b, c, d} of 
literals. We already demonstrated that M+ = {a, c, d} is not an answer set of 11. 
Accordingly, its subset {d} is unfounded on {a, c, d} w.r.t. 11, because the only 
rule in 11 with d in the head 

is such that U n B+ = {d} n {d} ^ 0. 

We say that a program 11 entails a formula F when for any consistent and com- 
plete set M of literals, if M"*" is an answer set for H, then M F. For instance, 
any program 11 entails each rule occurring in 11. 



3 Generating Supported Models 

In Section |4] we will define, for an arbitrary program 11, a graph SMn representing 
the application of the SMODELS algorithm to 11; the terminal nodes of SMn are 
answer sets of H. As a step in this direction, we describe here a simpler graph 

ATLEASTn- 



3.1 Graph ATLEASTn 

The terminal nodes of ATLEASTn are supported models of 11. The transition rules 
defining ATLEASTn are closely related to procedure Atleast (jSimons 2000| Sec- 
tions 4.1), which is one of the core procedures of the SMODELS algorithm. 

The nodes of ATLEASTn are the states relative to the set of atoms occurring in 11. 
The edges of the graph ATLEASTn are described by the transition rules Decide, Fail, 
Backtrack introduced in Section [2.1l and the additional transition rule£0 presented 

^ The Corollary 2 from JS acca and Zaniolo 1990|l refers to "assumption sets" rather than un- 
founded sets. But as the authors noted, in the context of this corollary the two concepts are 
equivalent. 

The names of some of these rules follow ijWard 2004[l . 
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Unit Propagate LP: 

a ^ B en and 



M =^ M a if 



B C M 



All Rules Cancelled: 

M ^ M ^a if B n M for all 5 e Bodies{n, a) 



Backchain True: 



M ^ Ml if 



Backchain False: 



a^ B eU, 
a e M, 

WnM ^0ioT all B' e Bodies{n, a) \ {B} , 
I e B 



a^ i,B e n, 

M =^ M I if i -na e M or a = ±, 
B C M 

Fig. 2. The additional transition rules of the graph ATLEASTn- 



in Figure [2] Note that each of the rules Unit Propagate LP and Backchain False 
is similar to Unit Propagate: the former corresponds to Unit Propagate on C V Z 
where / is the head of the rule, and the latter corresponds to Unit Propagate on 
C V I where I is an element of the body of the rule. 

This graph can be used for deciding whether program 11 has a supported model 
by constructing a path from to a terminal node: 

Proposition 2 

For any program 11, 

(a) graph ATLEASTn is finite and acyclic, 

(b) any terminal state of ATLEASTn other than FailState is a supported model 

of n, 

(c) FailState is reachable from in ATLEASTn if and only if 11 has no supported 
models. 

For instance, let 11 be program ([8|). Here is a path in ATLEASTn: 

=^ {Decide) 

a'^ => {Unit Propagate LP) 

a^ c =^ {All Rules Cancelled) (10) 

a^ c^b => {Decide) 

a^c^bd^ 

Since the state a^ c^bd^ is terminal. Proposition [2l^b) asserts that {a, c,^6, d} is 
a supported model of H. 

The assertion of Proposition [2] will remain true if we drop the transition rules 
Backchain True and Backchain False from the definition of ATLEASTn. 

In the rest of this section we give a proof of Proposition [2] 
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Lemma 2 

For any program IT and any state . . . Z„ reachable from in ATLEASTn , every 
supported model X for 11 satisfies k if it satisfies all decision literals Ij^ with j < i. 

Proof 

By induction on the path from to /i . . . Z„ . Similar to the proof of Lemma [U We 
will show that the property in question is preserved when the transition from M 
to M' is justified by any of the four new rules. 

Take a supported model X for 11, and consider an edge M =4> M' where M is a 
list li . . . Ik such that X satisfies k if it satisfies all decision literals 1^^ with j < i. 

Assume that X satisfies all decision literals in M'. 

Unit Propagate LP: M' is Ma. By the inductive hypothesis, for every literal in M 
the property in question holds. We need to show that X \= a. By the definition of 
Unit Propagate LP, B C M ior some rule a ^ B. Consequently, M \= B. From the 
inductive hypothesis and the assumption that X satisfies all decision literals in M' 
and hence in M, it follows that X |= M. Since X is a model of 11 we conclude 
that X \= a. 

All Rules Cancelled: M' is M^a and "B n M 7^ for every B e Bodies{U, a). 
Consequently, M \= -iB for every B e Bodies (H, a). By the inductive hypothesis, 
for every literal in M the property in question holds. We need to show that X \= -la. 
By contradiction. Assume that X \^ a. From the inductive hypothesis and the 
assumption that X satisfies all decision literals in M' and hence in M, it follows 
that X \= M. Since M |= -.B for every B e Bodies{U, a), it follows that X \= ^B. 
We conclude that X is not a supported model of 11. 

Backchain True: M' is Ml. By the inductive hypothesis, for every literal in M 
the property in question holds. We need to show that X \^ I. By contradiction. 
Assume X \= I. Consider the rule a ^ B corresponding to this application of 
Backchain True. Since I G B , X \= -iB. By the definition of Backchain True, 
WnM ^9 ior every B' in Bodies{Il, a) \ B. Consequently, M \= -^B' for every B' 
in Bodies (n, a) \ B. From the inductive hypothesis and the assumption that X 
satisfies all decision literals in M' and hence in M, it follows that X \= M. We 
conclude that X ^ -^B' for every B' in Bodies (11, a)\B. Hence X is not supported 

by n. 

Backchain False: M' is Ml. By the inductive hypothesis, for every literal in M the 
property in question holds. We need to show that X \=l. By contradiction. Assume 
that X \^ I. By the definition of Backchain False there exists a rule a /, B in 11 
such that G M and B C M. Consequently, M \= ^a and M ^ B. From the 
inductive hypothesis and the assumption that X satisfies all decision literals in M' 
and hence in M, it follows that X \^ M. We conclude that X \^ ^a and X ^ B. 
From the fact that X ^ I, it follows that X does not satisfy the rule a ^ I, B, so 
that it is not a model of IT. □ 

iProof of Proposition [2] 

Parts (a) and (c) are proved as in the proof of Proposition [U using Lemma [H 
(b) Let M be a terminal state so that none of the rules are applicable. From the 
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fact that Decide is not applicable, we conclude that M assigns all literals. Since 
neither Backtrack nor Fail is applicable, M is consistent. Consequently, M is an 
assignment. Since Unit Propagate LP is not applicable, it follows that for every 
rule a ^ B (= U, if B C M then a e M. Consequently, if M ^ 5 then M \= a. 
We conclude that M is a model of 11. We will now show that M is a supported 
model of 11. By contradiction. Suppose that M is not a supported model. Then, 
there is an atom a 6 M such that M ^ B for every B G Bodies(n, a). Since M 
is consistent, B n M ^ ^ for every B e Bodies{Il, a). Consequently, All Rules 
Cancelled is applicable. This contradicts the assumption that M is terminal. □ 

The fact that the assertion of Proposition[2]remains true if we drop the transition 
rules Backchain True and Backchain False from the definition of ATLEASTn follows 
from the proof of Proposition [2] (b) that does not refer to those rules. 



3.2 Relation between DPp and ATLEASTn 

It is well known that the supported models of a program can be characterized as 
models of program's completion in the sense of (jClark 1978^ . It turns out that the 
graph ATLEASTn is identical to the graph dp p , where F is the (clausified) comple- 
tion of n. To make this claim precise, we first review the notion of completion. 

For any program 11, its completion consists of 11 and the formulas that can be 
written as 

-aV y B (11) 

BeBodies{n.a} 

for every atom a in 11. \CNF — Compiji) is the completion converted to CNF us- 
ing straightforward equivalent transformations. In other words, iCNF — Comp{Jl) 
consists of clauses of two kinds: 

1. the rules a <— i? of the program written as clauses 

aV5, (12) 

2. formulas (flTt converted to CNF using the distributivity of disjunction over 
conjunctiorti^. 

Proposition 3 

For any program 11, the graphs ATLEASTn and cNF-Comp{n) are equal. 

For instance, let 11 be the program 

a <^ b, not c , , 

b. ^ ' 

Its completion is 

(a o 6 A -.c) A & A -.c, (14) 



It is essential that repetitions are not removed in the process of clausification. For instance, 
iCNF — Comp(a <— not a) is the formula (a V a) A {—•a V —•a). 
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and iCNF - Comp{n) is 

(a V ^6 V c) A (-.a V &) A {^a V ^c) A 6 A -.c. (15) 

Proposition [3] asserts that ATLEASTn coincides with OP cj^p_co^p(Yiy 

From Proposition |3l it foUows that applying the Atleast algorithm to a program 

essentially amounts to applying DPLL to its completion. 
In the rest of this section we give a proof of Proposition |3l 

It is easy to see that the states of the graphs ATLEASTn and op cNF-CompCn) 
coincide. We will now show that the edges of ATLEASTn and DPc^^_co„p(n) coincide 
also. 

It is clear that there is an edge M M' in ATLEASTn justified by the rule 
Decide if and only if there is an edge M =4> M' in DPcwf-Comp(n) justified by 
Decide. The same holds for the transition rules Fail and Backtrack. 

We will now show that if there is an edge from a state M to a state M' in the 
graph DPp/yp^_p(,„p(n) justified by the transition rule Unit Propagate then there is 
an edge from M to M' in ATLEASTn. Consider a clause C V I e iCNF — Comp (11) 
such that C C M . We will consider two cases, depending on whether C V I comes 
from (ini or from the CNF of (fTT|) . 

Case 1. C y I is aV B corresponding to a rule a -(^ B. 

Case 1.1. I is a. Then there is an edge from M to M' in ATLEASTn justified by 
the transition rule Unit Propagate LP. 

Case 1.2. I is an element of B. Then B has the form 7, D and C is aW D. From 
C C M we conclude that D C M and -^a € M. There is an edge from M to M' in 
the graph ATLEASTn justified by the following instance of Backchain False: 

( a ^1,D e n, 
M M Mf < -^ae M, 
[ D C M. 

Case 2. C V Z has the form -^aW D, where D is one of the clauses of the CNF of 

V ^ 

BeBodies{n,a) 

Then D has the form 

V f(B) 

BeBodies{n,a) 

where / is a function that maps every B G Bodies{Il, a) to an element of B. 

Case 2.1. I is -.a. Then C is D, so that 15 <Z M . Consequently f{B) e B n I? C 
'B n M, so that B n M ^ for every B e Bodies{Il, a). There is an edge from M 
to M' in ATLEASTn justified by All Rules Cancelled. 

Case 2.2. I is an element of D. From the construction of £), it follows that / = 
f{B) G B for some rule a ■(^ B. Then C is 

-aV V f{B'). 

B'eBodies{n,a)\B 

From C C M we conclude that a G M and that f{B') G M for every B' G 
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Bodies{Il, a)\B. Since f{B') is a conjunctive term of B', it follows that B'CiM ^ 0. 
Then there is an edge from M to M' in atleasth justified by Backchain True. 
We will now show that if there is an edge from a state M to a state M' in 

the graph ATLEASTn justified by one of the transition rules Unit Propagate LP, 
All Rules Cancelled, Backchain True, and Backchain False then there is an edge 
from M to M' in BPcNF-Comp{n)- 

Case 1. The edge is justified by Unit Propagate LP. Then there is a rule a ^ B G Tl 
where B C M, and M' is Ma. By the construction of iCNF - Comp{Il), a V S G 
iCNF — Comp{H). There is an edge from M to M' in ^PcNF-Compin) justified by 
the following instance of Unit Propagate: 



M =^ Ma if 



S V a e iCNF - Comp{U) and 
B CM. 



Case 2. The edge is justified by All Rules Cancelled. By the definition of All 
Rules Cancelled, there is an atom a such that for all B <E Bodies(n, a), B r\ M ^ %; 
and M' is M^a. Consequently, M contains the complement of some literal in B. 
Denote one of such literals by f{B), so that f{B) e M. Prom the construction of 
iCNF - Comp{n), 

-aV V f{B) 

BeBodies(n,a) 

belongs to iCNF — Comp{Il). By the choice of /, 



BeSodies(n,a) 



There is an edge from M to M' in bp cNF-Co7np(n) justified by the following instance 
of Unit Propagate: 



M =^ M^a if i 



V e iCNF - Comp{U), 

BeBodies(JI,a) 



V f{B)CM. 

BeBodies{n,a) 



Case 3. The edge is justified by Backchain True. By the definition of Backchain 
True, there is a rule a ^ B € H and a literal I G B such that a € M; for all 
B' G Bodi esjn, a) \ 5' n M ^ 0; and M' is Ml. Let f{B') be an element of B' 
such that f{B') G M. Prom the construction of iCNF - Comp{n), 

-aV/V V f{B') 

B'eBodies{n,a)\B 

belongs to iCNF — Comp (11). By the choice of /, 



V fiB') C M. 

B'eBodies{n,a)\B 



There is an edge from M to M' in p>p cNF-Comp(n) justified by the following instance 
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of Unit Propagate: 



M 



Ml if <^ 



-naVlV y f{B') eiCNF - Comp(n), 

B'eBodies{n,a)\B 

haw V f{B'))CM. 

B'eBodies{n,a)\B 

Case 4. The edge is justified by Backchain False. By the definition of Backchain 
False, there is a rule a <- /, B e IT such that -la e M, B C M, and M' is M7. 
By the construction of iCNF - Comp{U), aVBvle iCNF - Comp{Il). There 
is an edge from M to M' in op cNF-Comp{n} justified by the following instance of 
Unit Propagate: 



M 



M I if 



ay By I iCNF - Comp{n) and 
aVB C M. 



□ 



4 Answer Set Solver Smodels 

4-1 Abstract Smodels 

We now describe the graph SMn that represents the application of the SMODELS 
algorithm to program H. SMn is a graph whose nodes are the same as the nodes 
of the graph ATLEASTn- The edges of SMn are described by the transition rules of 
ATLEASTn and the additional transition rule: 

Unfounded: 

M M ( M is consistent, and 

\ a G [/ for a set U unfounded on M w.r.t. IT. 

This transition rule of SMn is closely related to procedure Atmost (jSimons 20001 
Sections 4.2), which together with the procedure Atleast forms the core of the 
SMODELS algorithm. 

The graph SMn can be used for deciding whether program IT has an answer set 
by constructing a path from to a terminal node: 

Proposition 4 

For any program IT, 

(a) graph SMn is finite and acyclic, 

(b) for any terminal state M of SMn other than FailState, M+ is an answer set 

of n, 

(c) FailState is reachable from in SMn if and only if 11 has no answer sets. 

To illustrate the difference between SMn and ATLEASTn, assume again that IT is 
program Path pO|) in the graph ATLEASTn is also a path in SMn. But state 
a'^c-'bd^, which is terminal in ATLEASTn, is not terminal in SMn- This is not 
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surprising, since {a, c, ^6, d}+ = {a, c, rf} is not an answer set of 11. To get to a 
state that is terminal in SMn, we need two more steps: 

a^c^bd^ (Unfounded, U — {d}) j-^g^ 

a^c^bd^^d (Backtrack) 
a^ c^b^d 

Proposition [DJb) asserts that {a, c} is an answer set of H. 

The assertion of Proposition[5]will remain true if we drop the transition rules All 
Rules Cancelled, Backchain True, and Backchain False from the definition of SMn- 

In the rest of this section we give a proof of Proposition S) 

We say that a model M of a program 11 is unfounded-free if no non-empty subset 
of M is an unfounded set on M w.r.t. 11. 

Lemma 3 (Corollary 2 from ^Saccd and Zaniolo 1990\} ) 

For any model M of a program 11, M+ is an answer set for 11 if and only if M is 
unfounded-free . 

Lemma 4 

For any unfounded set U on a consistent set M of literals w.r.t. a program 11, and 
any assignment X, ii X \= M and X D U 7^ 0, then X+ is not an answer set for H. 

Proof 

Assume that X+ is an answer set for 11. Then X is a model of 11. By Lemma [3l 
it follows that X is unfounded-free. Hence any non-empty subset of X including 
X n U is not unfounded on X. This means that for some rule a ^ i? in 11 such 
that a e X n f/, S n X and X n [/ n 5+ = 0. From X ^ M (M C X) and 
5 n X = we conclude that i? n M = 0. Since i? n X = and X is an assignment, 
5 C X. It follows that B+ C X. Consequently U n B+ = X D U D B+ = This 
contradicts the assumption that U is an unfounded set on M. □ 

Lemma 5 

For any program 11, any state Zi . . . Z„ reachable from in SMn, and any assign- 
ment X, if X+ is an answer set for 11 then X satisfies k if it satisfies all decision 
literals l^ with j < i. 

Proof 

By induction on the path from to /i . . . Z„. Recall that for any assignment X, if X+ 
is an answer set for 11, then X is a supported model of 11, and that the transition 
system SMn extends ATLEASTn only by the transition rule Unfounded. Given our 
proof of Lemma [U we only need to demonstrate that application of Unfounded 
preserves the property. 

Consider a transition M =^ M' justified by Unfounded, where M is a sequence 
Zi . . . Ik- M' is M^a, such that a & U , where U is an unfounded set on M w.r.t 11. 
Take any assignment X such that X+ is an answer set for 11 and X satisfies all 
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decision literals /j^ with j < k. By the inductive hypothesis, X \= M. Then X |= -la. 
Indeed, otherwise a would be a common element of X and U, and X D U would 
be non-empty, which contradicts Lemma |4l □ 

iProof of Proposition E] 

Parts (a) and (c) are proved as in the proof of Proposition [1] using Lemma [5] 
(b) As in the proof of Proposition[2jb) we conclude that M is a model of H. Assume 
that M+ is not an answer set. Then, by Lemma [3l there is a non-empty unfounded 
set U on M w.r.t. IT such that U C M. It follows that Unfounded is applicable 
(with an arbitrary a G f/). This contradicts the assumption that M is terminal. □ 

The fact that the assertion of Proposition|3]remains true if we drop the transition 
rules All Rules Cancelled, Backchain True, and Backchain False from the definition 
of SMn follows from the proof of Proposition|3] (b) that does not refer to those rules. 

4-2 Smodels Algorithm 

We can view a path in the graph SMn as a description of a process of search 
for an answer set for a program 11 by applying inference rules. Therefore, we can 
characterize the algorithm of an answer set solver that utilizes the inference rules of 
SMn by describing a strategy for choosing a path in SMn • A strategy can be based, 
in particular, on assigning priorities to some or all inference rules of SMn, so that 
a solver will never apply a transition rule in a state if a rule with higher priority is 
applicable to the same state. 

We use this method to describe the smodels algorithm. System smodels assigns 
priorities to the inference rules of SMn as follows: 

Backtrack, Fail 3> 

Unit Propagate LP, All Rules Cancelled, Backchain True, Backchain False ^ 

Unfounded 3> 

Decide. 

For example, let 11 be program ([8]). The smodels algorithm may follow a path 

=^ {Decide) 

( Unit Propagate LP) 
a^ c =^ {All Rules Cancelled) 

a^ c^b =^ {Unfounded) 
a^ c^b^d 

in the graph SMn, whereas it may never follow path (jlOp . because Unfounded has 
a higher priority than Decide. 

4-3 Tight Programs 

We will now review the definitions of a positive dependency graph and a tight 
program. The positive dependency graph of a program 11 is the directed graph G 
such that 
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• the nodes of G are the atoms occurring in 11, and 

• G contains the edges from a to hi {1 < i < I) for each rule 

a bi, . . . ,bi,not ...,not bm 

in n where a is an atom. 

A program is tight if its positive dependency graph is acyclic. For instance, pro- 
gram ([H) is not tight since its positive dependency graph has a cycle due to the rule 
-s— rf. On the other hand, the program constructed from ([8|) by removing this rule 
is tight. 

Recall that for any program 11 and any assignment M, if M+ is an answer set of 11 
then M is a supported model of H. For the case of tight programs, the converse 
holds also: M"*" is an answer set for 11 if and only if M is a supported model 
of n ( |Fages 19"94| or, in other words, is a model of the completion of 11. 

It turns out that for tight programs the graph SMn is "almost identical" to the 
graph DPp, where F is the clausified completion of 11. To make this claim precise, 
we need the following terminology. 

We say that an edge M M' in the graph SMn is singular if 

• the only transition rule justifying this edge is Unfounded, and 

• some edge M M" can be justified by a transition rule other than Un- 
founded or Decide. 

For instance, let 11 be the program 

a <— 6 
b ^ c. 

The edge 

a^b^^c^ =^ {Unfounded, U ^ {a, b}) 
a^ b^^c^^a 
in the graph SMn is singular, because the edge 

a^b^^c'^ =^ {All Rules Gancelled) 
a^ b^^c^^b 

belongs to SMn also. 

With respect to the actual SMODELS algorithm ([Simons 2000| . singular edges of 
the graph SMn are inessential: in view of priorities for choosing a path in SMn de- 
scribed in Section 1321 SMODELS never follows a singular edge. Indeed, the transition 
rule Unfounded has the lower priority than any other transition rule but Decide. 
By SM^ we denote the graph obtained from SMn by removing all singular edges. 

Proposition 5 

For any tight program 11, the graph SMjj is equal to each of the graphs ATLEASTn 
and DP cNF-Comp(ii)- 

For instance, let 11 be the program ([T3| . This program is tight, its completion is ([T4| . 
and \GNF — Gomp{I\) is formula p5|) . Proposition [5] asserts that, SMj^ coincides 
with oPcNF-CompCn) and with ATLEASTn- 
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From Proposition [SJ it follows that applying the SMODELS algorithm to a tight 
program essentially amounts to applying dpll to its completion. A similar relation- 
ship, in terms of pseudocode representations of SMODELS and dpll, is established 



in ( Giunchiglia and Maratea 2005 1 



In the rest of this section we give a proof of Proposition [5] 
Lemma 6 

For any tight program IT and any non-empty unfounded set C/ on a consistent set 
M of literals w.r.t. IT there is an atom a £ U such that for every B G Bodies{n, a), 
5nM ^ 0. 



Proof 

By contradiction. Assume that, for every a € U there exists B £ Bodies{Il, a) such 
that BnM — By the definition of an unfounded set it follows that for every atom 
a E U there is _B G Bodies {H, a) such that U DB^ ^ 0. Consequently the subgraph 
of the positive dependency graph of IT induced by U has no terminal nodes. Then, 
the program 11 is not tight. □ 

iProof of Proposition [5] 

In view of Proposition |31 it is sufficient to prove that SMj^ equals ATLEASTn; or, 
in other words, that every edge of SMn justified by the rule Unfounded only is 
singular. Consider such an edge M M' . We need to show that some transi- 
tion rule other than Unfounded or Decide is applicable to M . By the definition of 
Unfounded, M is consistent and there exists a non-empty set U unfounded on M 
w.r.t. n. By Lemma [HI it follows that there is an atom a £ U such that for every 
B G Bodies{Il, a), B Ci M $. Therefore, the transition rule All Rules Cancelled is 
applicable to M. □ 



5 Generate and Test 

In this section, we present a modification of the graph DP p (Section 12. ip that 
includes testing "partial" assignments of F found by dpll. 

Let be a CNF formula, and let G be a formula formed from atoms occur- 
ring in F. The terminal nodes of the graph GTp.a defined below are models of 
formula FAG. 

This modification of the graph dp^- is of interest, for example, in connection 
with the fact that answer sets of a program 11 can be characterized as models of 
its completion extended by so called loop formulas of 11 ()Lin and Zhao 2002[) . If 
iCNF — Co77ip(n), as above, is the completion converted to CNF, and LF(n) is 
the conjunction of all loop formulas of 11, then for any assignment M, M+ is an 
answer set of 11 iff M is a model of iCNF — Comp(n) A LF(n). Hence, the terminal 
nodes of the graph GTcNF-Comp{n),LF{n) will correspond to answer sets of 11. 

The nodes of the graph GTj? g are the same as the nodes of the graph DP p. The 
edges of GTp^o are described by the transition rules of DP^;' and the additional 
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transition rule: 

Test: 

{M is consistent, 
/ e M 

It is easy to see that the graph DPir is a subgraph of GTf,g- The latter graph 
can be used for deciding whether a formula FAG has a model by constructing a 
path from to a terminal node: 

Proposition 6 

For any CNF formula F and a formula G formed from atoms occurring in f , 

(a) graph GTf,g is finite and acyclic, 

(b) any terminal state of GTf,g other than FailState is a model of F A G, 

(c) FailState is reachable from in GTp.G if and only if f A G is unsatisfiable. 

Note that to verify the applicability of the new transition rule Test we need a pro- 
cedure for testing whether G entails a clause, but there is no need to explicitly write 
out G. This is important because LF{I\) can be very long (|Lin and Zhao 2002^ . 
For instance, let 11 be the nontight program 

Its completion is 

and \CNF - Comp{Ji) is 

(d V -.d). 

This program has one loop formula 

d ^ _L. 

Proposition [6] asserts that a terminal state -id of GT cNF-Comp{ii),d^i. is a model of 
\GNF - Gomp{n)hLF{IV). It follows that {-.d}+ = 0isan answer set of H. To com- 
pare with the graph 'O'P cNF-Comp(ny- state d is a terminal state in 'D'P cNF-Comp(n) 
whereas d is not a terminal state in GT cNF-Comp{n) , because the transition rule 
Test is applicable to this state. 

ASP-SAT with Backtracking ( [Giunchiglia et al. 2006[ ) is a procedure that com- 
putes models of the completion of the given program using dpll, and tests them 
until an answer set is found. The application of this procedure to a program 11 
can be viewed as constructing a path from to a terminal node in the graph 

cNF-Comp{n),LF(n) by adopting a strategy that Test is applied to a state M only 
when M is an assignment. 

In the rest of this section we give a proof of Proposition |6l 

Lemma 1 

For any CNF formula f , a formula G formed from atoms occurring in F, and a 
path from to a state l\ . . .1^ in GTp^Q^ any model X of F A G satisfies 4 if it 
satisfies all decision literals with j < i. 
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Proof 

By induction on the path from to /i . . . /„ . Similar to the proof of Lemma [TJ We 
will show that the property in question is preserved by the transition rule Test. 



Assume that X satisfies all decision literals from M. By the inductive hypothesis, 
X \= M. We will show that the rule justifying the transition from M to M' is 
different from Test. By contradiction. M' is Ml. By the definition of Test, G \= M. 
Since X is a model of -F A G it follows that X \= M. This contradicts the fact that 
X \^ M. □ 

iProof of Proposition [6] 

Part (a) and part (c) Right-to- left are proved as in the proof of Proposition [T] 
(b) Let M be any terminal state other than FailState. As in the proof of Proposi- 
tion[lJb) it follows that M is a model of F. The transition rule Test is not applicable. 
Hence G ^ M. In other words M is a model of G. We conclude that M is a model 



(c) Left-to-right: Since FailState is reachable from 0, there is a state M without 
decision literals such that M is reachable from and the transition rule Fail is 
applicable in M. Then, M is inconsistent. By Lemma [71 any model of F A G 
satisfies M. Since M is inconsistent we conclude F A G is unsatisfiable. □ 



Most modern SAT solvers implement such sophisticated techniques as backjumping 
and learning: 

Backjumping: Chronological Backtracking (used in classical dpll) can be 
seen as a prototype of Backjumping. Unlike Backtracking that undoes only the 
previously made decision, Backjumping is generally able to backtrack further 
in the search tree by undoing several decisions at once. 

Learning: Most modern SAT solvers implement so called conflict-driven 
backjumping and learning: whenever backjumping is performed they add 
(learn) a "backjump clause" to the clause database of a solver. Learning back- 
jump clauses prevents a solver from reaching "similar" inconsistent states. 

In this section we will extend the graph DPp to capture the ideas behind back- 
jumping and learning. The new graph will be closely related to the DPLL System 
with Learning graph introduced in (jNieuwenhuis et al. 2006| Section 2.4). 

We first note that the graph DPp is not adequate to capture such technique as 
learning since it is incapable to reflect a change in a state of computation related 
to newly learned clauses. We start by redefining a state so that it incorporates 
information about changes performed on a clause database. 

For a CNF formula F, an augmented state relative to F is either a distinguished 
state FailState or a pair M||r where M is a record relative to the set of atoms 
occurring in F, and F is a (multi-)set of clauses over atoms of F that are entailed 




of F A G 



6 Revievir: Abstract DPLL vi^ith Learning 



by F. 
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Unit Propagate A: 

C V Z e F U r and 



M||r =^ Mi\\r if 

Backjump: 

Pl^QWT =^ Pl'\\T if 
Learn: 

M\\r =4> M\\c, r if 



C CM 

Pl^ Q is inconsistent and 
F ^ I'WP 

every atom in C occurs in F and 
F[= C 



Fig. 3. The additional transition rules of the graph DPLp. 



We now define a graph dpl f for any CNF formula F. Its nodes are the augmented 
states relative to F. The transition rules Decide and Fail of BPp are extended to 
DPLf as follows: M||r =4> M'||r (M||r =^ FailState) is an edge in DPL^ justified 
by Decide (Fail) if and only if M =^ M' (M =^ FailState) is an edge in DPj? 
justified by Decide {Fail). Figure [3] presents the other transition rules of DVLp. We 
refer to the transition rules Unit Propagate A, Backjump^ Decide ^ and Fail of the 
graph DPLi? as Basic. We say that a node in the graph is semi-terminal if no rule 
other than Learn is applicable to it. 

We will omit the word "augmented" before "state" when this is clear from a 
context. 

The graph dpl p can be used for deciding the satisfiability of a formula F simply 
by constructing an arbitrary path from node 0||0 to a semi-terminal node: 

Proposition 7 

For any CNF formula F, 

(a) every path in DPLp contains only finitely many edges justified by Basic tran- 
sition rules, 

(b) for any semi-terminal state M||r of BPLp reachable from 0||0, M is a model 
of F, 

(c) FailState is reachable from 0||0 in BPLp if and only if F is unsatisfiable. 

On the one hand, Proposition [7] (a) asserts that if we construct a path from 0||0 so 
that Basic transition rules periodically appear in it then some semi-terminal state 
will be eventually reached. On the other hand, Proposition [7] (b) and (c) assert 
that as soon as a semi-terminal state is reached the problem of deciding whether 
formula F is satisfiable is solved. The proof of this proposition is similar to the 
proof of Theorem 2.12 from (jNieuwenhuis et al. 2006p . 
For instance, let F be the formula 

aV b 



-la V c. 
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Here is a path in DPLi?: 

0||0 => (Learn) 

0||6 V c =^ (Decide) 

-nb'^WbW c =^ ( Unit Propagate A) (17) 
-16^ c| I & V c ( Unit Propagate A) 

-.&^ca||6 V c 

Since the state -^b^ca is semi-terminal, Proposition [7] (b) asserts that c, a} is 
a model of F . 

Recall that the transition rule Backtrack of the graph dp^? - a prototype of Back- 
jump - is applicable in any inconsistent state with a decision literal in DVp. The 
transition rule Backjump, on the other hand, is applicable in any inconsistent state 
with a decision literal that is reachable from 0||0 (the proof of this statement is 
similar to the proof of Lemma 2.8 from (jNieuwenhuis et al. 2006[) ). The application 
of Backjump where is the last decision literal and V is 7 can be seen as an appli- 
cation of Backtrack. This fact shows that Backjump is essentially a generalization of 
Backtrack. The subgraph of DPi? induced by the nodes reachable from is basically 
a subgraph of dpl^. 



7 Answer Set Solver with Learning 

In this section we will extend the graph SMn to capture backjumping and learning. 
As a result we will be able to model the algorithms of systems SMODELScc and SUP. 



7.1 Graph SMLn 

An (augmented) state relative to a program 11 is either a distinguished state Fail- 
State or a pair of the form M||r where M is a record relative to the set of atoms 
occurring in 11, and F is a (multi-)set of constraints formed from atoms occurring 
in n that are entailed by 11. 

For any program 11, we will define a graph SMLn. Its nodes are the augmented 
states relative to 11. The transition rules Unit Propagate LP, All Rules Cancelled, 
Backchain True, Unfounded, Decide and Fail of SMn are extended to SMLn as 
follows: M||F =^ M'\\T (M\\T =^ FailState) is an edge in SMLn justified by a 
transition rule T if and only if M =^ M' (M =4> FailState) is an edge in SMn 
justified by T. Figure |4] presents the other transition rules of SMLn. 

We refer to the transition rules Unit Propagate LP, All Rules Cancelled, Backchain 
True, Backchain False A, Unfounded, Backjump LP, Decide, and Fail of the graph 
SMLn as Basic. We say that a node in the graph is semi-terminal if no rule other 
than Learn LP is applicable to it. 

The graph SMLn can be used for deciding whether a program Ft has an answer 
set by constructing a path from 0||0 to a semi-terminal node: 

Proposition 8 

For any program 11, 



Abstract Answer Set Solvers with Backjumping and Learning (long version) 23 



Backchain False A: 

r a^i,B e nur, 

M||r =^ M 1\\T a \ e M or a = _L, 
[ 5 C M 

Backjump LP: 

p,An\\r K Pl'Wr T / ^ <3 is inconsistent and 

=^ ||i n | jj entails Z'VP 

Learn LP: 

M\\T =^ M\\ ^ B,T if n entails 5 



Fig. 4. The additional transition rules of the graph SMLn- 

(a) every path in SMLn contains only finitely many edges labeled by Basic tran- 
sition rules, 

(b) for any semi-terminal state M||r of SMLn reachable from 0||0, is an 
answer set of 11, 

(c) FailState is reachable from 0||0 in SMLn if and only if 11 has no answer sets. 

Thus if we construct a path from 0||0 so that Basic transition rules periodically 
appear in it then some semi-terminal state will be eventually reached; as soon as a 
semi-terminal state is reached the problem of finding an answer set is solved. 

For instance, let 11 be program (jS]). Here is a path in SMLn with every edge 
annotated by the name of a transition rule that justifies the presence of this edge 
in the graph : 

0||0 =^ (Decide) 

a'^ 1 1 =^ ( Unit Propagate LP) 

a^c\\(/} =^ {All Rules Cancelled) 

a'^c^b\\(D =^ {Decide) 

a'^c^bd'^W^ => {Unfounded) ^ ' 

a'^c^bd'^^d\\9 =^ {Backjump LP) 

a'^c-.&-.(i||0 =^ {Learn LP) 
a'^ c^b^dW^a V V & V -^rf 

Since the state a'^ c^b^d is semi-terminal, Proposition [8] (b) asserts that 

{a, c, = {a, c} 

is an answer set for 11. 

Proof of Proposition [8] is in Section [TOl 

As in case of the graphs DPp and dplj?, Backjump LP is applicable in any 
inconsistent state with a decision literal that is reachable from 0||0 (Proposition [TT] 
from Section |9|), and is essentially a generalization of the transition rule Backtrack 
of the graph SMn. 

Modern SAT solvers often implement such sophisticated techniques as restart 
and forgetting in addition to backjumping and learning: 



24 



Y. Lierler 



Restart: A solver restarts the dpll procedure whenever the search is not 
making "enough" progress. The idea is that upon a restart a solver will explore 
a new part of the search space using the clauses that have been learned. 
Forgetting: This technique is usually implemented in relation with conflict- 
driven backjumping and learning. When a solver "notes" that earlier learned 
clauses are not helpful anymore it removes (forgets) them from the clause 
database. Forgetting allows a solver to avoid a possible exponential space 
blow-up introduced by learning. 

We may extend the graph SMLn with the following transition rules that capture 
the ideas behind these technique: 

Restart: 

M\\r => 0||r 

Forget LP: 

M\\ ^ B,r =^ M\\T. 

The transition rules Restart and Forget LP are similar to the analogous rules 
in ([Nieuwenhuis et al. 2006^ for extending dpll procedure with restart and for- 
getting techniques. It is easy to prove a result similar to Proposition |8] for the 
graph SMLn with Restart and Forget LP (for such graph a state is semi-terminal if 
no rule other than Learn LP, Restart, Forget LP is applicable to it.) 

7.2 Smodelscc and Sup Algorithms 

In Section [4. 2 1 we demonstrated a method for specifying the algorithm of an answer 
set solver by means of the graph SMn. In particular, we described the SMODELS al- 
gorithm by assigning priorities to transition rules of SMn. In this section we use this 
method to describe the SMODELScc ( |Ward and Schlipf 2004] ) and SUP (jLierler 2008^ 
algorithms by means of SMLn- 

System SMODELS cc enhances the SMODELS algorithm with conflict-driven back- 
jumping and learning. Its strategy for choosing a path in the graph SMLn is similar 
to that of SMODELS. System SMODELScc assigns priorities to inference rules of SMLn 
as follows: 

Backjump LP, Fail ^ 

Unit Propagate LP, All Rules Cancelled, Backchain True, Backchain False A ^ 

Unfounded ^ 

Decide. 

Also, SMODELScc always applies the transition rule Learn LP in a non-semi- 
terminal state reached by an application of Backjump LP, because it implements 
conflict-driven backjumping and learninglil In Section [11] we discuss details on 
which clause is being learned during the application of Learn LP. 

System SMODELScc (sup) also implements restarts and forgetting that may be modeled by the 
transition rules Restart and Forget LP. An application of these transition rules in SMLn relies 
on particular heuristics implemented by the solver. 
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In (IL ierler 2008^ , we introduced the simplified SUP algorithm that relies on back- 
tracking rather than conflict-driven backjumping and learning that are actually 
implemented in the system. We now present the SUP algorithm that takes these 
sophisticated techniques into account. 

System SUP assigns priorities to inference rules of SMLn as follows: 

Backjump LP, Fail ^ 

Unit Propagate LP, All Rules Cancelled, Backchain True, Backchain False X ^ 

Decide ^ 

Unfounded. 

Similarly to SMODELS cc, SUP always applies the transition rule Learn LP in a 
non-semi-terminal state reached by an application of Backjump LP. 

For example, let 11 be program ([8|). Path (fT8)) corresponds to an execution of 
system SUP, but does not correspond to any execution of SMODELS cc because for 
the latter Unfounded is a rule of higher priority than Decide. Here is another path 
in SMLn from 0||0 to the same semi-terminal node: 

0||0 =^ (Decide) 

a'^ 1 1 =^ { Unit Propagate LP) 

a^c\\9 =^ {All Rules Cancelled) (19) 

c - 6 1 1 =^ ( Unfounded) 
a^c^6^d||0 



Path (fT9|) corresponds to an execution of system SMODELS cc, but does not corre- 
spond to any execution of system SUP because for the latter Decide is a rule of 
higher priority than Unfounded. 

The strategy of sup of assigning the transition rule Unfounded the lowest priority 
may be reasonable for many problems. For instance, it is easy to see that transi- 
tion rule Unfounded is redundant for tight programs. The SUP algorithm is similar 
to SAT-based answer set solvers such as ASSAT (ILin and Zhao 2004| and CMOD- 
els ( [Giunchiglia et al. 2006[ ) (see Section [5?^ in the fact that it will first compute 
a supported model of a program and only then will test whether this model is 
indeed an answer set, i.e., whether Unfounded is applicable in this state. 



8 Generate and Test with Learning 

In this section we model backjumping and learning for the generate and test pro- 
cedure by defining a graph GTLp c that extends GTf,g (Section [5]) in a similar 
manner as DPLp (Section [6|) extends dp p. 



8.1 Graph GThp.G 

An (augmented ) state relative to a CNF formula F and a formula C formed from 
atoms occurring in F is either a distinguished state FailState or a pair of the 
form M||r, where M is a record (Section l2.ip relative to the set of atoms occurring 
in F, and P is a (multi-)set of clauses formed from atoms occurring in F that are 
entailed by A G. 
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The nodes of the graph GTLf g are the augmented states relative to a CNF 
formula F and a formula G formed from atoms occurring in F. The edges of GTLp^c 
are described by the transition rules Unit Propagate A, Decide, Fail of DPLp, the 
transition rules 

Backjump GT : 

Pl'^QWr =^ Pl'\\r if ^ ^^"^Q is inconsistent and 



F AG^l'\/ P 



Learn GT: 

M\\r =i> M||c, r if 



every atom in G occurs in F and 
FAG^G 

and the transition rule Test of GTj^^g that is extended to GTLj^.g as follows: 
M||r =^ M'\\r is an edge in GTLf,g justified by Test if and only if M =^ M' is 
an edge in GTp.G justified by Test. 

We refer to the transition rules Unit Propagate A, Test, Decide, Fail, Back- 
jump GT of the graph GTLi^^G as Basic. We say that a node in the graph is 
semi-terminal if no rule other than Learn GT is applicable to it. 

The graph GTLf.g can be used for deciding whether a formula FAG has a model 
by constructing a path from 0||0 to a terminal node: 

Proposition 9 

For any CNF formula F and a formula G formed from atoms occurring in F, 

(a) every path in GTLp^c contains only finitely many edges labeled by Basic 
transition rules, 

(b) for any semi-terminal state M||r of GTLp^c reachable from 0|I0, M is a model 
of FAG, 

(c) FailState is reachable from 0| |0 in G'TI^f,g if and only if F A G is unsatisfiable. 

As in case of the graph dplj?, the transition rule Backjump GT is applicable in 
any inconsistent state with a decision literal that is reachable from 0||0. We call 
such states backjump states. 

Proposition 10 

For any CNF formula F and a formula G formed from atoms occurring in F , the 
transition rule Backjump GT is applicable in any backjump state in GTL^-^g- 

Proofs of Propositions [9] and [10] are given in Section [131 



8.2 Cmodels Algorithm 

System cmodels implements an algorithm called ASP-SAT with Learning ( jCiunchiglia et al. 2006 ) 
that extends ASP-SAT with Backtracking by backjumping and learning. 

The application of cmodels to a program IT can be viewed as constructing a 
path from 0||0 to a terminal node in the graph GTL^^g, where 



• F is the completion of 11 converted to conjunctive normal form, and 

• G is LF{n). 
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In Sections 14.21 we demonstrated a method for specifying the algorithm of an 
answer set solver by means of the graph SMn- We use this method to describe the 
CMODELS algorithm using the graph GTL^^g. System CMODELS assigns priorities 
to the inference rules of GTLp o as follows: 

Backjump GT, Fail ^ 
Unit Propagate A S> 
Decide ^ 
Test. 

Also, CMODELS always applies the transition rule Learn GT in a non-semi- 
terminal state reached by an application of Backjump GT. 

The priorities imposed on the rules by CMODELS guarantee that the transition 
rule Test is applied to a model of FUF (clausificd completion F extended by learned 
clauses F). This allows CMODELS to proceed with its search in case if a found model 
is not an answer set. Furthermore, the CMODELS strategy guarantees that in a state 
reached by an application of Test, first Backjump GT will be applied and then in the 
resulting state Learn GT will be applied. The clause learned due to this application 
of Learn GT is derived by means of loop formulas (see ( [Giunchiglia et al. 2006 1). 



In this sense CMODELS uses loop formulas to guide its search. 

Systems SAG (|Lin et al. 2006^ and CLASP (jGebser et al. 2007P are answer set 
solvers that are enhancements of CMODELS. First, they compute and clausify pro- 
gram's completion and then use unit propagate on resulting propositional formula 
as an inference mechanism. Second, they guide their search by means of loop for- 
mulas. Third, they implement conflict-driven backjumping and learning. Also, SAG 
uses SAT solvers for search. The systems differ from CMODELS in the following: 

• they maintain the data structure representing an input logic program through 
out the whole computation, 

• in addition to implementing inference rules of the graph GTL^-^g they also 
implement the inference rule Unfounded of SMn. A hybrid graph combining 
the inference rule Unfounded of SMn and the inference rules of GTLp c may 
be used to describe the sag and CLASP algorithms. 

System SAG assigns the same priorities to the inference rules of the hybrid graph 
as CMODELS. Also, SAG at random decides whether to apply the inference rule 
Unfounded in a state. 

On the other hand, system CLASP assigns priorities to the inference rules of the 
hybrid graph as follows: 

Backjump GT, Fail > 

Unit Propagate A, Unfounded ^ 

Decide. 



Like CMODELS, both SAG and CLASP always apply the transition rule Learn GT 
in a non-semi-terminal state reached by an application of Backjump GT. 
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9 Backjumping and Extended Graph 



Recall the transition rule Backjump LP of SMLn 



Backjump LP: 



Pl^ 



Pl'\\T if 



{ 



PI Q is inconsistent and 
n entails V V P. 



A state in the graph SMLn is a backjump state if it is inconsistent, contains a 
decision literal, and is reachable from 0||0. Note that it may be not clear a priori 
whether Backjump LP is applicable to a backjump state and if so to which state the 
edge due to the application of Backjump LP leads. These questions are important 
if we want to base an algorithm on this framework. It turns out that Backjump LP 
is always applicable to a backjump state: 

Proposition 11 

For a program H, the transition rule Backjump LP is applicable to any backjump 
state in SMLn- 

Proposition 1111 guarantees that a backjump state in SMLn is never semi-terminal. In 
the end of this section we show how Proposition [Til can be derived from the results 
proved later in this paper. Next question to answer is how to continue choosing 
a path in the graph after reaching a backjump state. To answer this question we 
introduce the notions of reason and extended graph. 

For a program 11, we say that a clause Z V C is a reason for I to be in a list of 
literals PIQ w.r.t 11 if 11 entails / V C and C C P. We can equivalently restate 
the second condition of Backjump LP "H entails /' V P" as "there exists a reason 
for I' to be in PI' w.r.t. 11" (note that /' V P is a reason for I' to be in PI'). We 
call a reason for I' to be m PI' a backjump clause. Note that Proposition [Til asserts 
that a backjump clause always exists for a backjump state. It is clear that we may 
continue choosing a path in the graph after reaching a backjump state if we know 
how to compute a backjump clause for this state. We now define a graph SMlJj 
that shares many properties of SMLn but allows us to give a simpler procedure for 
computing a backjump clause. 

An extended record M relative to a program 11 is a list of literals over the set of 
atoms occurring in 11 where 

(i) each literal I in M is annotated either by A or by a reason for / to be in M 
w.r.t. n, 

(ii) M contains no repetitions, 

(iii) for any inconsistent prefix of M its last literal is annotated by a reason. 
For instance, let 11 be the program 



a ■<— not b 



c. 



The list of literals 
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is an extended record relative to 11. On the other hand, the hsts of hterals 

„A „A „A i^6V-ia lA lA „A i^-^bV-'a „A 

a -^a a — lO o o a c 

are not extended records. 

An extended state relative to a program 11 is either a distinguished state FailState 
or a pair of the form M||r where M is an extended record relative to 11, and F 
is the same as in the definition of an augmented state (i.e., F is a (multi-)set of 
constraints formed from atoms occurring in FE that are entailed by H.) It is easy to 
see that for any extended state S relative to a program FE, the result of removing 
annotations from all nondecision literals of 5" is a state of SMLn: we will denote this 
state by S^. 

For instance, consider program a not b. All pairs 

FailState 0||0 a^^6^*^-^'^||0 ^a^6^^°||0 

are among valid extended states relative to this program. The corresponding states S^ 
are 

FailState 0||0 a^-6||0 -a^6||0. 

We now define a graph SMl}^ for any program FT. Its nodes are the extended 

states relative to 11. The transition rules of SMLn are extended to SMlJj as follows: 

^ . . . . . .1 I 

Si 5*2 is an edge in SMl|j justified by a transition rule T if and only if Si 5*2 

is an edge in SMLn justified by T. 

We will omit the word "extended" before "record" and "state" when this is clear 

from a context. 

The following lemma formally states the relationship between nodes of the graphs 
SMLn and SMlJjI 

Lemma 8 

For any program 11, if S' is a state reachable from 0||0 in the graph SMLn then 
there is a state S in the graph SMlJj such that S^ ^ S' . 

The definitions of Basic transition rules and semi-terminal states in SMlJj are 
similar to their definitions for SMLn- 

Proposition \S 

For any program 11, 

(a) every path in SMlJj contains only finitely many edges labeled by Basic tran- 
sition rules, 

(b) for any semi-terminal state M||r of SMlJj, M+ is an answer set of 11, 

(c) SMlJj contains an edge leading to FailState if and only if 11 has no answer 
sets. 

Note that Proposition [8f (b), unlike Proposition [8] (b), is not limited to semi- 
terminal states that are reachable from 0||0. As in the case of the graph SMLn, 
SMlJj can be used for deciding whether a program 11 has an answer set. Further- 
more, the new graph provides the means for computing a backjump clause that 
permits practical application of the transition rule Backjump LP: Sections 110.31 
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and llll describe the Backjump Clause (Algorithm [1} and BackjumpClauseFirstUIP 
(Algorithm [2]) procedures that compute Decision and FirstUIP backjump clauses 
respectively. 

We say that a state in the graph SMlJj is a backjump state if its record is incon- 
sistent and contains a decision literal. Unlike the definition of a backjump state in 
SMLn, this definition does not require a backjump state to be reachable from 0||0 
in SMlJj. As in case of the graph SMLn, any backjump state in SMlJj is not semi- 
terminal: 

Proposition 

For a program 11, the transition rule Backjump LP is applicable to any backjump 
t 

state m SML^j. 

Proposition[8] (b), (c) and Proposition [11] easily follow from Lemma[8]and Propo- 
sition [8f (b), (c) and Proposition \ llf respectively. Proof of Proposition [8] (a) is 
similar to the proof of Proposition [Hf (a). 

Next section will present the proofs for Proposition [8f, Lemma [3 and Proposi- 
tion [TTf. It is interesting to note that the proofs of Lemma [8] and Proposition [TTf 
implicitly provide the means for choosing a path in the graph SMl|j: 

• given a state M||r and a transition rule Unit Propagate LP, All Rules Can- 
celled, Backchain True, Backchain False A, or Unfounded applicable to M||r, 
the proof of Lemma [8] describes a clause that may be used to construct a 
record M' so that there is an edge M||r M'||r due to this transition 
rule, 

• given a backjump state M||r, the proof of Proposition [TTf describes a back- 
jump clause that can be used to construct a record M' so that there is an 
edge M||r =^ M'||r due to Backjump LP. 

Furthermore, the construction of the proof of Proposition lllf paves the way for 
procedure Backjump Clause presented in Algorithm [TJ 

10 Proofs of Proposition [8f, Lemma [8], Proposition [TTf 
10.1 Proof of Proposition[Sf 

Lemma 9 

For any program H, an extended record M relative to H, and every assignment X 
such that is an answer set for H, if X satisfies all decision literals in M then 
X'^M. 

Proof 

By induction on the length of M . The property trivially holds for 0. We assume that 
the property holds for any state with n elements. Consider any state M with n-\-l 
elements. Let X be an assignment such that X^ is an answer set for H and X 
satisfies all decision literals in M . We will now show that X ^ M . 

Case 1. M has the form Pl^. By the inductive hypothesis, X ^ P. Since X 
satisfies all decision literals in M, X \= I. 
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Case 2. M has the form P/'^*^. By the inductive hypothesis, X |= P. By the 
definition of a reason, (i) 11 entails I V C and (ii) C C P. From (ii) it follows that 
P 1= -iC. Consequently, X |= -iC. From (i) it follows that for any assignment X 
such that X+ is an answer set, X |= / V C. Consequently, X \^ I. □ 

The proof of Proposition \8f assumes the correctness of Proposition llll that we 
demonstrate in Section flO. 31 
Proposition [2f 
For any program II, 

(a) every path in SMlJj contains only finitely many edges labeled by Basic tran- 
sition rules, 

(b) for any semi-terminal state MjlF of SMl]^, M+ is an answer set of II, 

(c) SMlJj contains an edge leading to FailState if and only if II has no answer 
sets. 

Proof 

(a) For any list N of literals by \N\ we denote the length of N . Any state M||r 
has the form MqI^ Mi . . . 1^ Mp\\T, where . . .1^ are all decision literals of M; 
we define a(M||r) as the sequence of nonnegative integers \Mq\, |Mi|, . . . , \Mp\, and 
a{FailState) = oo. For any states S and S' of SMl]^, we understand a{S) < a{S') 
as the lexicographical order. We first note that for any state M||r, value of a is 
based only on the first component M of the state. Second, there is a finite number 
of distinct values of a due to the fact that there is a finite number of distinct Ms 
over n. We conclude that there is a finite number of distinct values of a for the 
states of SMlJj, even though the number of distinct states in SMlJj is infinite. 

By the definition of the transition rules of SML^, if there is an edge from M||r 
to M'||r' in SMlJj formed by any Basic transition rule then Q;(M||r) < Q;(M'||r'). 
Then, due to the fact that there is a finite number of distinct values of a, it follows 
that there is only a finite number of edges due to the application of Basic rules 
possible in any path. 

(b) Let M||r be a semi-terminal state so that none of the Basic rules are applicable. 
From the fact that Decide is not applicable, we conclude that M assigns all literals. 

Furthermore, M is consistent. Indeed, assume that M is inconsistent. Then, since 
Fail is not applicable, M contains a decision literal. Consequently, M||r is a back- 
jump state. By Proposition llll . the transition rule Backjump LP is applicable 
in M||r. This contradicts our assumption that M||r is semi-terminal. 

Also, M is a model of II: since Unit Propagate LP is not applicable in M||r, it 
follows that for every rule a<— iJGlI, if5CM then a £ M. 

Assume that M+ is not an answer set. Then, by Lemma[3l there is a non-empty 
unfounded set U on M w.r.t. II such that U C M. It follows that Unfounded is 
applicable (with an arbitrary a € U) in M||r. This contradicts the assumption 
that M||r is semi-terminal. 

(c) Left-to-right: There is a state M||r in SMlJj such that there is an edge be- 
tween M| |r and FailState. By the definition of SMlJ[, this edge is due to the transi- 
tion rule Fail. Consequently, state M||r is such that M is inconsistent and contains 
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no decision literals. By Lemma [9l for every assignment X such that is an an- 
swer set for n, X satisfies M. Since M is inconsistent we conclude that 11 has no 
answer sets. 

Right-to-left: Consider the process of constructing a path consisting only of edges 
due to Basic transition rules. By (a), it follows that this path will eventually reach 
a semi-terminal state. By (b), this semi-terminal state cannot be different from 
FailState, because 11 has no answer sets. We conclude that there is an edge leading 
to FailState. □ 

10.2 Proof of Lemma\E 

The proof uses the notion of loop formula (|Lin and Zhao 2004[) . 

Given a set A of atoms by Bodies (11, A) we denote the set that consists of the 
elements of Bodies{Il, a) for all a in A. Let H be a program. For any set Y of atoms, 
the external support formula (|Lee 2005^ for Y is 

V (20) 

B£Bodies{n,Y),B + nY=<D 

We will denote the external support formula by ESu,y- For any set Y of atoms, 
the loop formula for Y is the implication 

V a^ESn,Y. 

aeY 

We can rewrite this formula as the disjunction 

f\ -^aWESn^Y- (21) 

aeY 

From the Main Theorem in (|Lee 2005P we conclude: 
Lemma on Loop Formulas 

For any program H, H entails loop formulas ([21]) for all sets Y of atoms that occur 
in n. 

For a state S in the graph SMlJj, we say that in SMLn is the image of S. 
Lemma\^ 

For any program H, if S' is a state reachable from 0||0 in the graph SMLn then 
there is a state S in the graph SMlJj such that 5'-'' = S' . 

Proof 

Since the property trivially holds for the initial state 0[|0, we only need to prove 
that all transition rules of SMLn preserve it. 

Consider an edge M||r =^ M'\\T' in the graph SMLn such that there is a state 
Mi||r in the graph SMlJj satisfying the condition (Mi||r)-'- — M\\V. We need to 
show that there is a state in the graph SMlJ^ such that M'||r' is its image in SMLn. 
Consider several cases that correspond to a transition rule leading from M||r to 
M'llF': 
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Unit Propagate LP: 

M\\T =^ M fflllr if 



a ^ _S e n and 
B d M. 



M'\\r' is Ma||r. It is sufficient to prove tfiat Mia°-'^^\\r is a state of SMlJj. It is 
enough to show that a clause a V 5 is a reason for a to be in Ma. By apphcabihty 
conditions of Unit Propagate LP, B C M. Since 11 entails its rule a B,Il entails 
a V 5. 

All Rules Cancelled: 

M\\T =^ M -^a\\r if S n M 7^ for aU B e Bodies{U, a). 

M'\\r' is M^a\\r. Consider any B e Bodies(n, a). Since 5 n M ^9, B contains a 
literal from M: call it f{B). It is sufficient to show that 

-aV V f{B) (22) 

BeBodies(n,a) 

is a reason for -la to be in M^a. 

First, by the choice of f{B), f{B) G M; consequently, 



V /(^) ^ ^- 

BeBodies{n,a) 

Second, since f{B) G B, the loop formula -^a V ESu,{a} entails ([22l) . By Lemma 
on Loop Formulas, it follows that 11 entails (P^ . 
Backchain True: 

B eU, 

Mlir =^ MZlir if <! ^ _^ ^ ^, ^ ^^^^g^^^jj^ 1^1^ 

M'||r' is Ml\\r. Consider any B' e Bodies(n, a)\B. Since s'nM ^ 0, B' contains 
a literal from M: call it f{B'). A clause 

ZV-aV Y f{B')- (23) 

B'eSotiies(n,a)\S 

is a reason for Z to be in ML The proof of this statement is similar to the case of 
All Rules Cancelled. 
Backchain False A: 

a^i,B e nur, 

M||r =^ M 7||r ii { G M or a = ±, 
B C M. 

M'\\T' is M I||r. A clause IvBV a is a reason for 7 to be in Ml. The proof of this 
statement is similar to the case of Unit Propagate LP. 
Unfounded: 



M\\T =^ M^aWT if 



M is consistent and 

a G [/ for a set U unfounded on M w.r.t. 11. 
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M'\\T' is M -.a||r. Consider any B e Bodies{n, U) such that [/ n 5+ = 0. By the 
definition of an unfounded set, it foUows that EDM ^ ^. Consequently, B contains 
a hteral from M: caU it f{B). The clause 

-«V V f{B) (24) 

Bodies(n,C/),B + nf/=0 

is a reason for -la to be in M^a. The proof of this statement is similar to the case 
of All Rules Cancelled. 

Backjump LP, Decide, Fail, and Learn LP: obvious. □ 

The process of turning a state of SMLn reachable from 0||0 into a corresponding 
state of SMlJj can be illustrated by the following example: Consider a program 11 

a not b 

b <— not a, not c 

c ^ not f 

^k,d (25) 

k <^ I, not h 

<— m, not I, not h 

m -f- not k, not I 

and a path in SMLn 

0||0 =^ (Decide) 
a^||0 =^ {All Rules Cancelled) 
a^^6||0 ^ {Decide) 
a'^-i6c'^||0 =^ {Backchain True) 
a^^bc^^f\\%==^ {Decide) 

a^^bc^^f d^W^I) {Backchain False X) ^ ' 

a^^bc^^f d^^k\\^ ^ {Backchain False A) 
a^^bc^^f d^^k^l\\% =^ {Backchain False A) 
a^^bc^^f d^^k^l^m\\% =^ {Unit Propagate LP) 
a^^bc^^f d^^k^l^mm\\% 

The construction in the proof of Lemma [S] applied to the nodes in this path gives 
following states of SMlJ^: 

0||0 
0-^110 

^A^^.6V^a^A^y./V.c^A||0 
^A^^^f,V^a^A^y^/V^c^A^^-.feV-.d||0 
^A^^^f,V^a^A^y^/V^c^A^^-.fcV-.d^piVbVfc||0 
„A^^^f,V-,a^A^y-./V^c^A^^-.fcV^d^piVbVfc. 
^A^^^f,V^a^A^y^/V^c^A^^^feV^d^piVfeVfc. 

It is clear that these nodes form a path in SMlJj with every edge justified by the 
same transition rule as the corresponding edge in path (p6|) in SMLn- 
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10.3 Proof of Proposition 1 1 it 

In this section 11 is an arbitrary and fixed logic program. 

For a record M, by lcp{M) we denote its largest consistent prefix. We say that a 
clause C is conEicting on a list M of literals if 11 entails C, and C C lcp{M). For 
example, let M be the first component of the last state in (|27)) : 

Then, lcp{M) is obtained by dropping the last element m™^^'^^ of M . It is clear 
that the reason m V A; V / for m to be in M is a conflicting clause on M . 

Lemma 10 

The literal that immediately follows lcp{M) in an inconsistent record M, has the 
form l'-^ where C is a conflicting clause on M . 



Proof 

By the requirement (iii) of the definition of an extended record, the literal that 
immediately follows lcp{M) may not be annotated by A. Consequently, the literal 
has the form . We now show that C is a conflicting clause on M. Since C is a 
reason for I to be in lcp{M)l'^ , it immediately follows that 11 entails C, C can be 
written as IVC , and C" C lcp{M). Since I immediately follows the largest consistent 
prefix of M, I £ lcp{M). Consequently, C C lcp{M). We conclude that C is indeed 
a conflicting clause on M. □ 

For any inconsistent record h ■ ■ • In and any conflicting clause C on this record, 
by /3i-^...i^{C) we denote the set of numbers i such that k G C. (It is clear that 
every element from C equals to one of the literals in • ■ • Z„.) The relation I < J 
between subsets I , J of {1 ■ ■ ■ n} is understood here as the lexicographical order 
between / and J sorted in descending order. For instance, {2 6 7} < {6 7 8} 
because {762}<{876}in lexicographical order. 

Recall that the resolution rule can be applied to clauses C V / and C" V -^l and 
produces the clause C V C , called the resolvent of C V Z and C V on I. 

Lemma 11 

Let M be a record and let /■^ be a nondecision literal from lcp{M). If clause D is 
the resolvent of B and a clause C conflicting on M then 

(i) D is a clause conflicting on M, 

(ii) Pm{D) < I3m{C). 

For instance, let M be ([28|) . let reason -im V Z V & for -im in lcp{M) be 5, and let 
conflicting clause m V fc V Z on M be C . Then £>, the result of resolving B together 
with C, is clause kW IV b. Lemma [TT] asserts that fc V Z V 6 is a conflicting clause 
on M and that /3m(-D) < /3m(C). Indeed, /3m(£') = {2 6 7} and /3m(C) = {6 7 8}. 
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Proof 

(i) Clause D is a resolvent of B and C on some literal /'. Then, for some literal 
I' e B,l/ e C. The clause C can be written as F V C . 

In order to demonstrate that D is a conflicting clause we need to show that 
iJ C lcp{M) and 11 entails D. 

Since 5 is a reason for I to be in lcp{M), H entails B and B has the form / V B' 
where B' C lcp{M). Since C is a conflicting clause on M, C C lcp{M) and 11 
entails C . From the fact that lcp{M) is consistent, it follows that there is no literal 
in B' such that its complement occurs in C . Consequently, /' ^ B' so that V is I 
and D is B'W C . We conclude that D C lcp{M). From the fact that U entails B, U 
entails C, and the construction oi D, it follows that 11 entails D. 

(ii) From the proof of (i) it follows that is a resolvent of B and C on / 
where B has the form I V B'. Since 5 is a reason for I to be in lcp{M), every 
literal in B' precedes I in lcp{M). Since D is derived by replacing I in C with B', 
Pm{D)<Pm{B). □ 

Let record M be h ■ ■ ■ k ■ ■ ■ In, the decision level of a literal is the number of 
decision literals in /i • • • Z^: we denote it by decMik)- We will also use this notation 
to denote the decision level of a set of literals: For a set P C M of literals, decM{P) 
is the decision level of the literal in P that occurs latest in M. For record M and a 
decision level j by we denote the prefix of M that consists of the literals in M 
that belong to decision level less than j and by M^l we denote the prefix of M that 
consists of the literals in M that belong to decision level less than or equal to j. 
For instance, let M be record (|28p then decM{~^k) = 3, decMi^b c -ifc) — 3, is 
^A^^-fcv-a^A^y-/v-c^ and M^l is M itself. 

Lemma 12 

For an inconsistent record M and a confiicting clause / V C on M, if decMi}) > 
decM{c) for all c e C then lcp{MY'"'"^'^^U^'^^ is a record. 

Proof 

We need to show that (i) I ^ IcpiMY""'-'^'^^ and (ii) ? V C is a reason for I to be 
in /, i.e, H entails Z V C and C C fcp(M)'^'='=("^)l. 

Since Z V C is conflicting on M, Z V C C lcp(M). From the consistency of 
lcp{M) and the fact that I G lcp{M), it follows that / ^ lcp{M). Consequently, 
Z ^ lcp{MY<"'(^^\ 

Since /V C is conflicting on M, H entails /V C and IW C C lcp{M). Consequently, 
C C lcp{M). From the definition of decmiC), it follows that decM{C) is the decision 
level of the literal in C that occurs latest in lcp{M). By the definition of a decision 
level, IJ C /cp(M)'^'='="("^)l . □ 

Proposition Mlj 

For a program H, the transition rule Backjump LP is applicable to any backjump 
t 

state m SML|t. 
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Proof 

Let M| |r be a backjump state in SMlJj. Let R be the list of reasons that are assigned 
to the nondecision hterals in lcp{M). 

Consider the process of building a sequence Ci , C2 , . . . of clauses so that 

• Ci is the reason of the member of M that immediately follows lcp{M), and 

• Cj (j > 1) is a resolvent of Q-i and some clause in R 

while derivation of new clauses is possible. From Lemma [TT] (i) and the choice of Ci 
and R, it follows that any clause in Ci, C2 . . . is conflicting. By Lemma [11] (ii) 
we conclude that /3m(Q) < /3m(C,-i) > 1)- It is clear that this process will 
terminate after deriving some clause Cm, since the number of conflicting clauses 
on M is finite. It is clear that clause Cm cannot be resolved against any clause in R. 

Case 1. Cm is the empty clause. Since M||r is a backjump state, M contains a 
decision literal l^. By part (iii) of the definition of a record, I belongs to lcp{M). 
Consequently, M can be represented in the form kp(M)'^'^'^*''^') l^ Q. 

By the choice of Ci, Ci is a reason and must consist of at least one literal. 
Consequently, m > 1. Clause C,„ is derived from clauses C„i-i and some clause 
in R. Since Cm is empty, Cm-i is a unit clause /'. We will show that 

IcpiMY"""^^^ l^ Q\\T =^ IcpiMY"'"-'^^^ f'\\T 

is an application oi Backjump LP. It is sufficient to demonstrate that lcp{M)'^'^'^"^^'> I'' 
is a record. Since lcp{M)'^'^'^"'-^'> l^ Q is a record, we only need to show that I' ^ 
Icp^MY"""^^'^ and clause I' is a reason for I' to be in IcpiMY"'""^'-'' I' . RecaU that 
Cm-i, i-e., I', is a conflicting clause. Consequently, 11 entails I' and I' E lcp{M). 
Since lcp{M) is consistent, I' ^ lcp{M) so that I' ^ lcp{MY'""''^^K On the other 
hand, from the fact that 11 entails I' it immediately follows that clause I' is a reason 
for I' to be in /cp(M) I'. 

Case 2. Cm is not empty. Since Cm is a conflicting clause on M, the complement 
of any literal in Cm belongs to lcp{M). Furthermore, every such complement is a 

decision literal in lcp{M). Indeed, if this complement is l'^^ G lcp{M) then I V S 
is one of the clauses Bi , and it can be resolved against Cm ■ 

By the deflnition of a decision level, there is at most one decision literal that 
belongs to any decision level. It follows that Cm can be written as I V Cm so 
that decM(l) > decM^c) for any c € C!^. Consequently, M can be written as 
lcp{My'"'«C^U'^ Q. Note that 

icpiMY^^^'C^U^ Q\\r =^ icp{MY'"'"'^'^'^U^'-\\r 

is an application of Backjump LP. Indeed, by Lemma fT2l /cpfM)'^'^'^^^'^'^'"^! Z*^'" is a 
record. □ 

Algorithm [T] presents procedure Backjump Clause that computes a backjump 
clause for any backjump state in the graph SMlJj. The algorithm follows from the 
construction of the proof of Proposition [TTf. It is based on the iterative application 
of the resolution rule on reasons of the smallest inconsistent prefix of a state. The 
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BackjumpClause (M||r); 

Arguments : M||r is a backjump state in SMlJj 

Return Value : C is a backjump clause 

begin 

C the reason of the member of M that immediately follows lcp{M); 
N the list of the nondecision literals in lcp{M); 
R the list of the reasons that are assigned to the literals in A''; 
while Cn do 

I <r- a literal in "C n TV; 
B the clause in R that contains I; 
C' the resolvent of C and B on I; 
if C" = then 
L return C 
C ^ C 
return C ; 

Algorithm 1: A procedure for generating a backjump clause. 



proof of Proposition ! 1 if allows to conclude the termination of BackjumpClause and 
asserts that a clause returned by the procedure is a backjump clause on a backjump 
state. 

For instance, let 11 be (j25p . Consider an execution of BackjumpClause on 11 and 
backjump state (|28l). The table below gives the values of lcp{M), C, N, and R 
during the execution of the BackjumpClause algorithm. By Ci we denote a value 
of C before the z-th iteration of the while loop. 

mV kV I 



lcp{M) 
Ci 



N 
R 



.f^-'bV^a ^f^fW^c -^f^^kV^d ^ 



-■ivbvk^^- 

■^ly by k. 



imVlVb 

-^mV IV b 



(29) 



C2 

C5 



kVlVbis the resolvent of Ci and -im V Z V 6 

V 6 is the resolvent of C2 and -1/ V bVk 
-id V 6 is the resolvent of C3 and ^ky 
-id V is the resolvent of C4 and -i& V -la 



The algorithm will terminate with the clause -id V ^a. Proof of Proposition [TTf 
asserts that (i) this clause is a backjump clause such that d and a are decision 
literals in M and (ii) the transition 

^A^^-.bV^a ^A^pfy^c ^A^j^^kV^d^l-^l^b^k. 



(30) 



in SML'I'n is an application of Backjump LP. Indeed, by Lemma[T2]/cp(M)'^'^'^*'(""'l ^d"''^"", 
in other words a^-i&^''^^''-id^'^^^'', is a record. 

Note that a backjump clause may be derived in other ways than captured by 
BackjumpClause algorithm: the transition rule Backjump LP is applicable with 
an arbitrary backjump clause. Usually, DPLL-like procedures implement conflict- 
driven backjumping and learning where a particular learning schema such as, for 
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instance, Decision or FirstUIP ([Mitchell 2005P is applied for computing a special 
kind of a backjump clause. It turns out that the Backjump Clause algorithm captures 
the Decision learning schema for ASP. Typically, SAT solvers impose an order for 
resolving the literals during the process of Decision backjump clause derivation. We 
can impose similar order by replacing the line 

/ ^ a literal inV n N 

in the algorithm Backjump Clause with 

I ■(— a literal in C n that occurs latest in lcp{M). 

In fact, the sample application of Backjump Clause algorithm described in (j29|) 
follows this ordering. 

This section introduced Backjump Clause algorithm that derives a Decision back- 
jump clause for an arbitrary backjump state. In the next section we will introduce 
an algorithm that will compute an ASP counterpart of FirstUIP backjump clause. 



11 FirstUIP Conflict-Driven Backjumping and Learning 

Conflict-driven backjumping and learning proved to be a highly successful tech- 
nique in modern SAT solving. Furthermore, in ( [Zhang et al. 200T| ) the authors 
investigated the performance of various learning schemes and established exper- 
imentally that FirstUIP clause is the most useful single clause to learn. Success of 
conflict-driven learning led to the implementation of its ASP counterpart in systems 
SMODELScc, CLASP, and SUP. There are two common methods for describing a back- 
jump clause construction in the SAT literature. The first one employes the implica- 
tion graph fMarq ues-Silva and Saka llah 1996[) and the second one employes resolu- 
tion ([Mtchcll 200l|. Ward and Schlipf (Ward and Schlipf 2004[) extended the defi- 



nition of an implication graph to the SMODELS algorithm and implemented FirstUIP 
learning schema in answer set solver SMODELS cc- In the previous section we used 
SMlJj formalism and resolution to describe the Backjump Clause algorithm for com- 
puting an ASP counterpart of a Decision backjump clause. In ([Gebser et al. 2007|) 
the authors used the concepts from constraint processing to implement FirstUIP 
learning schema in answer set solver CLASP. 

This section presents the BackjumpClauseFirstUIP algorithm for computing an 
ASP counterpart of a FirstUIP backjump clause by means of SMlJj formalism and 
resolution. The BackjumpClauseFirstUIP algorithm is employed by the system SUP 
in its implementation of conflict-driven backjumping and learning. 

The Algorithm [2| presents procedure BackjumpClauseFirstUIP that computes a 
FirstUIP backjump clause for any backjump state in the graph SMlJj. 

We now state the correctness of the algorithm BackjumpClauseFirstUIP . We 
start by showing its termination. By Ci we will denote the initial value assigned to 
clause C. From Lemma (Til (i) and the choice of Ci we conclude that at any point of 
computation clause C is conflicting on M. By Lemma [TT] (ii). the value of I3m{C) 
decreases with each new assignment of clause C in the while loop. It follows that 
the while loop will terminate since the number of conflicting clauses C on M such 
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BackjumpClauseFirstUIP (M||r); 

Arguments : M||r is a backjump state in SMlJj 

Return Value : C is a backjump clause 

begin 

C the reason of the member of M that immediately follows lcp{M); 
I -f- the literal in C that occurs latest in lcp{M); 

P •<— the sublist of lcp{M) that consists of the literals that belong to the decision 
level dec{l); 

R ^r- the list of the reasons that are assigned to the literals in P; 
while |CnP| > 1 do 

I <— the literal in C that occurs latest in P; 

B -f- the clause in R that contains I; 

C <— the resolvent of C and B on I; 

return C ; 

Algorithm 2: A procedure for generating a FirstUIP backjump clause. 



that \C D P\ > 1 is finite. By Cm we will denote the clause C with which the 
while loop terminates. In other words BackjumpClauseFirstUIP returns Cm- We 
now show that Cm is indeed a backjump clause. We already concluded that C,„ 
is a conflicting clause on M. Furthermore, from the termination condition of the 
while loop I Cm H P | < 1 . From the choice of Ci and P it follows that | Cm H P | = 1 . 
Consequently, C„j can be written as / V Cm where I is in singleton Cm H P. By 
Lemma [TT] (ii), /3{Cm) < /3(Ci)- From the definition of /? and the choice of P it 
follows that rfecM(T) > decM(c) for all c e C^,. By LemmaHa IcpiMY"""'-'^^^ l'^- 
is a record. In other words, transition 

M||r =^ lcp{My''''"(^'^h'^"'\\T 

is an application of Backjump LP. Consequently, Cm is a backjump clause. 

For instance, let 11 be (1251) . Consider an execution of BackjumpClauseFirstUIP 
on n and a backjump state (1^51) . The table below gives the values of lcp{M), C, P, 
and R during the execution of BackjumpClauseFirstUIP . By Ci we denote a value 
of C before the i-th iteration of the while loop. 

IcpiM) a^^h'^^'^-^'' c^^pf'^^'^d^-.^ 
Ci mV kV I 

R -^ky ^d, ^ly by k. ^my ly b 



C2 
C3 



ky ly b is the resolvent of Ci and V / V 6 
ky b is the resolvent of C2 and -^ly by k. 



The BackjumpClauseFirstUIP algorithm will terminate with the clause ky b. The 
proof of the correctness of BackjumpClauseFirstUIP asserts that fcV & is a backjump 
clause and the transition 



(31) 
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in SML^n is an application of Backjump LP. 

12 Extended Graph: Generate and Test 

In this section we introduce an extended graph GTl]^ ^ for the generate and test 
abstract framework GTLp^o similar as in Section [9] we introduced SMlJj for SMLn- 

For a formula H, we say that a clause Z V C is a reason for I to be in a list PIQ 
of literals w.r.t. H if H \= I V C andV C P. 

An (extended) record M relative to a formula _ff is a list of literals over the set 
of atoms occurring in H where 

(i) each literal I in M is annotated either by A or by a reason for I to be in M 
w.r.t. H, 

(ii) M contains no repetitions, 

(iii) for any inconsistent prefix of M its last literal is annotated by a reason. 

An (extended) state relative to a CNF formula F, and a formula G formed from 
atoms occurring in F is either a distinguished state FailState or a pair of the 
form M||r, where M is an extended record relative to A G, and F is the same 
as in the definition of an augmented state (i.e., F is a (multi-)set of clauses formed 
from atoms occurring in F that are entailed by f A G.) For any extended state S 
relative to F and G, the result of removing annotations from all nondecision literals 
of 5* is a state of GTLp^c- we will denote this state by S-^. 

For a CNF formula F and a formula G formed from atoms occurring in F, we 
will define a graph GTL^ q . The set of the nodes of GTl]^ ^ consists of the extended 
states relative to F and G. The transition rules of GTL^-.g are extended to GTl]^ q 
as follows: =^ S2 is an edge in GTl]^, ^ justified by a transition rule T if and 
only if Si =^ S2 is an edge in GTL^^g justified by T. 

The lemma below formally states the relationship between nodes of the graphs 
GTLf^g and GTh'p q-. 

Lemma 13 

For any CNF formula F and a formula G formed from atoms occurring in F /li S' 
is a state reachable from 0||0 in the graph GTLf.g then there is a state S in the 
graph GTL^ Q such that S^ = S' . 

The definitions of Basic transition rules and semi-terminal states in GTl]^ q are 
similar to their definitions for GTI^f,g ■ 
Proposition [Pf 

For any CNF formula F and a formula G formed from atoms occurring in f , 

(a) every path in GTl]^ q contains only finitely many edges labeled by Basic 
transition rules, 

(b) for any semi-terminal state MjjF of GTl]^ M is a model of FAG, 

(c) GTl]^ q contains an edge leading to FailState if and only if F A G is unsatis- 
fiable. 



42 



Y. Lierler 



We say that a state in the graph GTLp g, is a backjump state if its record is 
inconsistent and contains a decision hteral. As in case of the graph GTL^^g, any 
backjump state in GTLp q is not semi-terminah 

Proposition \10i 

For any CNF formula F and a formula G formed from atoms occurring in F, the 
transition rule Backjump GT is applicable in any backjump state in GThp q. 

Proposition|n](b), (c) and Proposition[TU]easily follow from Lemma [T^ and Propo- 
sition [3f (b) , (c) and Proposition I10| l respectively. Proof of Proposition [5] (a) is 
similar to the proof of Proposition [3f (a). 

13 Proofs of Proposition [of , Lemma 1131 Proposition [Tof 
13.1 Proof of Proposition \S 

Lemma 14 

For any CNF formula F, a formula G formed from atoms occurring in F, an ex- 
tended record M relative to A G, and any model X of A G, if X satisfies all 
decision literals in M then X \^ M. 

Proof 

By induction on the length of M. The property trivially holds for 0. We assume that 
the property holds for any state with n elements. Consider any state M with n + 1 
elements. Let X be a model of F A G such that X satisfies all decision literals in M. 

Case 1. M has the form Pl"^. By the inductive hypothesis, X ^ P. Since X 
satisfies all decision literals in M, X \= l^. 

Case 2. M has the form Pl^^'-^ . By the inductive hypothesis, X ^ P. By the 
definition of a reason (i) FAG entails I V G and (ii) G 'Z P. From (ii) it follows 
that P 1= -.G. Consequently, X ^ -.G. From (i) it follows that X \= IV C. We 
conclude that X \= I. □ 

The proof of Proposition [9f assumes the correctness of Proposition II Ol that we 
demonstrate in Section [13.31 

iProof of Proposition inf 

Parts (a) and (c) are proved as in the proof of Proposition [8f, using Lemma [T4l 
(b) Let M| |r be a semi-terminal state so that none of the Basic rules are applicable. 
From the fact that Decide is not applicable, we conclude that M assigns all literals. 

Furthermore, M is consistent. Indeed, assume that M is inconsistent. Then, since 
Fail is not applicable, M contains a decision literal. Consequently, M||r is a back- 
jump state. By Proposition [TOf, the transition rule Backjump GT is applicable in 
M||r. This contradicts our assumption that M||r is semi-terminal. 

Also, M is a model of F: since Unit Propagate A is not applicable, it follows that 
for every clause GVZeFUFifGCM then I e M. Consequently, M \= C V I. 
Furthermore, M is a model of G: since Test is not applicable, then G ^ M. We 
conclude that M \= G. Consequently, M is a model of A G. □ 
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13.2 Proof of Lemma\M 

For a state S in the graph GTl]^ g,, we say that S^ in Gihp.G is the image of 5*. 
Lemma\T^ 

For any CNF formula F and a formula G formed from atoms occurring in F, if 5" 
is a state reachable from 0||0 in the graph GThp.G then there is a state S in the 
graph GTL'p Q such that S^ = S' . 

Proof 

Since the property trivially holds for the initial state 0||0, we only need to prove 
that all transition rules of Gt:i^f,g preserve it. 

Consider an edge M||r M'||r' in the graph GThF,G such that there is a 
state Mi||r in the graph GTl]^ ^, satisfying the condition (Mi||r)^ = M||r. We 
need to show that there is a state in the graph GTi^p q such that M'l|r' is its 
image in GTh f^g- Consider several cases that correspond to a transition rule leading 
from Mllr to M'llr': 



Unit Propagate A: 

MIIF Ml\\V if 



C V Z e F U F and 
C C M. 



M'\\T' is Ml\\T. It is sufficient to prove that Mil^'^^\\T is a state of GTl], ^. It is 
enough to show that a clause C V Z is a reason for Z to be in w.r.t. FAG, i.e, 
FAG \= CVl and C C M. By apphcability conditions of Unit Propagate X, C C M. 
By the definition of a state FAG entails F. Since G V I e F nV, F A G \^ C V I. 
Test: 

M is consistent, 
M||F => Ml\\T a { G\=W, 
I G M. 

M'\\r' is M7||F. It is sufficient to prove that MJ'^ \\T is a state of GTl], p. M has 
the form I V C. It is enough to show that a clause 7 V C is a reason for I to be in 
Ml w.r.t. F A G. It is trivial that G C M. By applicability condition of the rule, 
G h^V G. 

Backjump GT, Decide, Fail, and Learn GT: obvious. □ 



13.3 Proof of Proposition [7of 

For a state M/'^||F, we say that a reason G is a backjump clause if there is a 
transition Backjump GT leading to MZ'^||F in GTLf.g- 

In this section F is an arbitrary and fixed CNF formula and G is an arbitrary 
and fixed formula formed from atoms occurring in F. 

For a record M, by lcp{M) we denote its largest consistent prefix. We say that a 
clause G is conflicting on a list M of literals if F A G entails G, and G C lcp{M). 

Lemmas [TOl [Tl] [T2l hold for the case of extended record relative to a formula. The 
proofs of the lemmas have to be modified only by replacing 11 with FAG. 

Proposition llOf is proved as Proposition II if. 
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Algorithms Backjump Clause and BackjumpClauseFirstUIP are applicable to the 
backjump states of the graph GTl]^ 

14 Experiments with Sup 

Here we present experimental analysis that compares performance of the system 
SUP versus CMODELS, clasp, SMODELS, and SMODELS cc- We start by describing the 
implementation details of SUP. 

The implementation of SUP utilizes 

• the interface of SAT-solver minisat (vl.l2b) that supports non-clausal con- 
straints described in (|Een and Sorensson 2003() in order to introduce addi- 
tional inference possibilities, but unit propagation. In particular, SUP im- 
plements Backchain True and All Rules Cancelled by means of non-clausal 
constraints and it uses the unit propagate of minisat to capture Unit Propa- 
gate LP and Backchain False. 

• parts of CMODELS code that eliminate weight and choice rules; perform model 
verification; and compute loop formulas. In particular, SUP uses the latter two 
parts of CMODELS code to capture Unfounded. 

In the experiments we used the following versions of the systems: SUP v. 0.1, 
SUP V. 0.2, CMODELS V. 3.77 using minisat v. 1.12b, clasp v. 1.0.5, SMODELS v. 2.32, 
SMODELScc V. 1.08 (implemented on top of SMODELS v. 2.26). System SUP (v. 0.1 
and V. 0.2) extends the implementation of minisat v. 1.12b. Therefore, we compare 
SUP performance against CMODELS that uses minisat 1.12b for its inference. Sys- 
tem SUP V. 0.1 stands for a version of SUP that implements Unit Propagate LP, All 
Rules Cancelled, and Backchain False A propagation rules, and does not implement 
Backchain True. System SUP v 0.2, on the other hand, also implements Backchain 
True. 

All considered solvers use preprocessor LPARSE (see Footnote 5) to ground the 
problems so that the systems are run on identical ground instances. Grounding time 
is not accounted for in solving time. All times are reported in seconds. Symbol tout 
stands for the fact that a system did not terminate with a solution after 10 minutes. 
Sup 0.1, Sup, Cm, CI, SiUcc, and Sm stand for SUP v. 0.1, SUP v. 0.2, CMODELS, 
CLASP, SMODELScc, and SMODELS respectively. The symbol —t abbreviates the flag 
— temp that allows SUP to forget learnt clauses due to loop formulas (by default SUP 
adds these clauses into permanent clause database). The symbol —a abbreviates 
the flag —atomreason that forces CMODELS, like SUP, to add only a clause implied 
by some loop formula and unsatisfied by a current model rather than the complete 
loop formula unsatisfied by the model. By default, CMODELS adds a complete loop 
formula unsatisfied by the model. All experiments were run on Intel(R) Pentium(R) 
D CPU 3.00GIIz, 2 cpu cores, cache size 1024 KB, running Linux. 

Table [1] presents the experiments run on tight programs. Recall that for tight 
programs (i) the transition rule Unfounded of SUP is never used for inference and 
(ii) the transition rule Test of CMODELS is never used for inference. 
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Instance 


Sup 0.1 


Sup 


Cm 


CI 


Smcc 


Sm 


Towers of Hanoi: http : / / asparagus 


. cs . uni-potsdam . de 




towers-hanoi.35_6 


43.68 


92.96 


74.03 


23.86 


115.83 


62.19 


towers-hanoi.36_6 


183.05 


70.71 


117.56 


38.98 


112.44 


86.01 


towers-hanoi. 3 7 Jo 


35.71 


30.75 


290.65 


34.25 


84.75 


120.53 


towers-hanoi.38_6 


243.41 


233.82 


37.03 


70.94 


99.87 


168.88 


towers-hanoi. 39_6 


96.59 


tout 


tout 


123.40 


384.24 


237.90 


towers-hanoi. 40_6 


tout 


113.15 


30.21 


114.39 


124.73 


329.08 


towers-hanoi. 41_6 


123.00 


69.51 


103.61 


169.43 


168.74 


466.15 


towers- hanoi.42_6 


tout 


389.88 


91.28 


182.10 


tout 


tout 


towers-hanoi. 43_6 


tout 


42.40 


353.74 


228.37 


204.89 


tout 


towers-hanoi. 44_6 


501.80 


438.78 


498.89 


tout 


tout 


tout 


Pigeon Holes with 10 holes: pgh#pig 


eons 








pgh7 


0.18 


0.22 


0.14 


0.10 


20.98 


1.67 


pghS 


0.75 


1.10 


0.68 


1.00 


tout 


8.87 


pgh9 


7.60 


9.30 


4.03 


5.73 


tout 


47.31 


Queens Normal Encoding: q.lp.#queens 


q.lp.18 


0.14 


0.14 


0.14 


0.08 


155.62 


7.84 


q.lp.22 


0.28 


0.27 


0.30 


0.14 


tout 


tout 


q.lp.24 


0.35 


0.35 


0.38 


0.20 


tout 


tout 


q.lp.30 


0.70 


0.71 


0.75 


0.44 


tout 


tout 


Queens Cardinality Constraint Encoding: q.lp2. ^queens 


q.lp2.18 


0.06 


0.05 


0.06 


0.04 


147.88 


2.24 


q.lp2.22 


0.11 


0.10 


0.12 


0.08 


tout 


191.76 


q.lp2.24 


0.15 


0.14 


0.17 


0.11 


tout 


267.27 


q.lp2.30 


0.30 


0.28 


0.32 


0.21 


tout 


tout 


TOAST: http: //as 


paragus 


. cs . uni 


-potsdam.de 






sequence3-ss3-Plain 


138.08 


49.69 


78.45 


10.51 


444.41 


tout 


sequence4-ss2-Plain 


14.86 


14.68 


24.50 


9.56 


99.85 


71.42 


sequence4-ss3-Plain 


408.99 


468.25 


tout 


294.27 


tout 


tout 


sequence3-ss2 


8.11 


9.00 


8.34 


5.75 


79.23 


46.07 


sequence3-ss3 


137.37 


49.84 


78.45 


10.39 


444.16 


tout 


sequence4-ss2 


16.13 


16.88 


13.38 


8.96 


102.98 


74.85 


sequcnce4-ss3 


103.33 


207.21 


16.74 


233.09 


tout 


tout 


Vertex Cover vcx.^ minimum size vertex cover: 






http : //www. cs . engr .uky . edu/ai /benchmarks .html 




vcl.53 


8.30 


3.23 


19.81 


tout 


tout 


218.59 


vc2.50 


11.55 


6.76 


13.32 


94.43 


tout 


11.73 


vc3.55 


0.65 


4.09 


64.44 


512.56 


tout 


10.59 


vc4.54 


6.49 


10.19 


23.05 


tout 


tout 


tout 



Table 1: Experiments: Tight problems. 



Table [2] presents the experiments run on nontight programs: 
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Instance 


Sup 


Sup -t 


Cm 


Cm -a 


CI 


Smcc 


Sm 


Deterministic Automaton: http : //www 


. f mi . uni-stuttgart . de/ 


szs/ 




research/projects/synthesis 


/benchinarks030923 . html 


mutcxSMorin 


15.30 


15.25 


15.68 


15.68 


15.45 


306.30 


153.60 


mutcx4IDFD 


0.80 


0.74 


0.66 


0.64 


0.80 


35.53 


13.46 


phiSMorin 


0.96 


0.95 


0.62 


0.72 


0.37 


2.50 


2.80 


phi4IDFD 


12.11 


14.85 


0.39 


2.98 


0.02 


0.31 


0.16 


phi4Morin 


tout 


448.82 


105.54 


tout 


95.50 


tout 


tout 


phiSIDFD 


tout 


tout 


67.14 


tout 


138.56 


tout 


tout 


Bounded Model Checking: 












http : //www. tcs 


.hut .f i/ '--^kepa/experiments/boundsmodels/ 




dpl0.i.O2.bl2 


30.88 


3.51 


10.14 


22.29 


0.20 


tout 


63.11 


dpl0.s.O2.b9 


0.54 


0.42 


0.78 


0.68 


0.14 


26.18 


13.04 


dpl2.i.02.bl4 


254.79 


88.18 


188.89 


272.55 


7.17 


tout 


tout 


dpl2.s.O2.bl0 


5.89 


2.01 


0.76 


2.05 


0.72 


tout 


337.28 


dp6.i.02.b8 


0.20 


0.20 


0.18 


0.16 


0.02 


0.93 


0.32 


dp8.i.O2.bl0 


0.98 


0.77 


1.53 


2.60 


0.03 


12.97 


4.12 


dp8.s.02.b8 


0.12 


0.20 


0.15 


0.10 


0.02 


2.58 


1.18 


Hamiltonian Cycle: http : //www . cs . engr . uky . e 


du/ ai/benchmarks . html 


hc_lS 


tout 


2.82 


tout 


tout 


tout 


tout 


tout 


hc_2S 


0.29 


5.37 


13.50 


8.60 


0.38 


153.44 


tout 


hc_3S 


1.28 


8.15 


5.94 


3.10 


tout 


tout 


tout 


hc_4S 


7.08 


2.81 


tout 


0.94 


2.18 


14.92 


tout 



Table 2: Experiments: Nontight problems. 



Overall the results demonstrated by SUP place the system in the class of highly 
efficient answer set solvers. 



15 Related Work 

Simons (I2000p and Ward (|2004p described the SMODELS and SMODELScc algorithms, 
respectively, by means of pseudocode and demonstrated their correctness. In this 
paper we designed an abstract framework that was used as an alternative method 
for describing these algorithms and demonstrating their correctness. 

Gebser and Schaub (|2006p provided a deductive system for describing inferences 
involved in computing answer sets by tableaux methods. The abstract framework 
presented here can be viewed as a deductive system also, but of a very different 
kind. First, it accounts for phenomena such as backjumping and learning (and also 
forgetting and restart) whereas the Gebser-Schaub system does not. Second, we 
describe backtracking by an inference rule, and the Gebser-Schaub system does 
not. Accordingly, the derivations considered in this paper describe search process, 
and derivations in the Gebser-Schaub system do not. Also, the abstract framework 
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discussed here does not have any inference rule similar to Cut; this is why its 
derivations are paths, rather than trees. 

16 Conclusions 

In this paper we showed how to model advanced algorithms for computing answer 
sets of a program by means of simple mathematical objects, graphs. We extended 
the abstract frameworks proposed in (jLierler 2008^ for describing native and SAT- 
based ASP algorithms to capture such sophisticated features as backjumping and 
learning. We characterized the algorithms of systems SMODELScc, SUP, and CMOD- 
ELS that implement these features. We note that the work on this abstract frame- 
work suggested the implementation of answer set solver SUP and the experimental 
analysis presented here demonstrates that SUP is a competitive representative in 
the family of answer set solvers. The abstract framework simplifies the analysis of 
the correctness of algorithms and allows us to study the relationship between vari- 
ous algorithms by analyzing the differences in strategies of choosing a path in the 
graph. For example, the description of the SMODELScc and SUP algorithms in this 
framework reflects their differences in a simple manner via distinct assignments of 
priorities to edges of the graph that characterize these systems. Also we used this 
framework to describe two algorithms for computing Decision and FirstUIP back- 
jump clauses for the implementation of conflict-driven backjumping and learning. 
This formalism provided the transparent means for specifying these algorithms. 
We believe that the development of this abstract framework powerful enough to 
describe advanced features of answer set solvers in a simple manner will promote 
the use of these sophisticated features in more solvers. This work helped us design 
the new solver SUP, and we hope that in the future it will suggest designs of other 
systems for computing answer sets. 
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